(12) INTERNATIONAL APPLICATION PUBLISHED UNDEK THE PATEN I 1 COOPERATION TREATY (PCT) 

iiiiiiiniiiiiiiuiiiiiiiuiiiiiiiiiiiiii^^ 



(19) World Intellectual Property 
Organization 

International Bureau 




(43) International Publication Date 
29 January 2004 (29.01.2004) PCT 



(10) International Publication Number 

WO 2004/009814 Al 



(51) International Patent Classification 7 : 
C12Q 1/68 



C12N 15/10, 



(21) Internationa] Application Number: 

PCT/DK2O03/0OO49O 



(22) International Filing Date: 

(25) Filing Language: 

(26) Publication Language: 



11 July 2003(11.07.2003) 
English 
English 



(30) Priority Data: 
PA 2002 01 130 
60/399,156 



23 July 2002 (23.07.2002) DK 
30 July 2002 (30.07.2002) US 



(71) Applicant (for all designated States except US): N DEVO- 
LUTION A/S [DK/DKI; Ronnegade 8, 5ih floor, DK-2 1 00 
Copenhagen 0 (DK). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): PEDERSEN, 



Henrik [DK/DK]; Frodesvej 24, DK-2880 Bagsva:rd 
(DK). THISTED, Thomas [DK/DK]; Fjordskrrcnten 14, 
DK-3600 Frederikssund (DK). M0LLER, Hans, Thor- 
leif [DK/DK]; Dpckerslundsvej 6, 1, DK-5000 Odense C 
(DK). 

(81) Designated States (national): AE, AG, AL, AM, AT, AU, 
AZ, BA 7 BB, BO, BR, BY, BZ, CA, CM, CN, CO, CR, CU, 
CZ, DE, DK, DM, DZ, EC, EE, ES, R, GB, GD, GE, GH, 
GM, MR, MU, ID, IL, IN, IS, JP, KE, KG, KP, KR, KZ, LC, 
LK, LR, LS, LT, LU, LV, MA, MD, MG, MK, MN, MW, 
MX, MZ, HI, NO, NZ, OM, PG, PH, PL, PT, RO, RU, SC, 
SD, SE, SG, SK, SL, SY, TJ, TM, TO, TR, TT, TZ, UA, 
UG, US, UZ, VC, VN, YU, ZA, ZM, ZW. 

(84) Designated States (regional): ARIPO patent (GH, GM, 
KE, LS, MW, MZ; SD, SL, SZ, TZ, UG, ZM, ZW), 
Eurasian patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), 
European patent (AT, BE, BG, CM, CY, CZ, DE, DK, EE, 
ES, FI, FR, GB, GR, HU, IE, IT, LU, MC, NL, I>T, RO, 

I Continued on next page} 



(54) Title: GENE S MUFFING BY TEMPLATE SWITCHING 



< 

00 
OS 



O 



Priming region 



Generation of templates 
For in vltio ua nscrtption- 



Priming region 



n-promotor ^ 



Priming region 



RNA synthesis 



Fragmentation 



Recombination by 
Forced copy- choice 



T7 RNA porymeraao 
transcription 



Rocombined sequences 



y 

r 


VY 


. Jfr 4 

" . 


r 


■ ' VT 


zz , 






1 Fragmentation 
I with iodine 


»s 


VY 


K , 


^ — 


n 


17. , 


M 


VY 


« , 






| Reverse transcription 




a 


_. 




n 


, A, 




.... rt . 




1 • 




> i 

i TJL t > 


LSL ; - 


, ,W.. 




iJU 


VY 






y* 


1 


,-fi. 


yv ■ 


a. 



(57) Abstract: The present invention provides a new 
approach to creating novel polynucleotide sequences 
by point mutation and recombination in vitro of a set of 
parental sequences. The new polynucleotide sequences 
can be useful in themselves or they can be used for 
the templated synthesis of polymers, e.g. polypeptides 
composed of cc-amino acids as occurring naturally in 
protein synthesis or polymers comprised of P-amino 
acids or other building blocks as described (Templated 
molecules). 
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Gene shuffling by template swithcing 

Field of the Invention 

5 The present invention relates to creating novel nucleic acid sequences by recombi- 
nation in vitro of a set of parental sequences. The parental sequences may for ex- 
ample be prepared by mutagenesis. The novel nucleic acid sequences may be use- 
ful per se or they may be utilised as templates for synthesis of polymers, e.g. poly- 
peptides composed of a-amino acids or polymers comprised of (3-amino acids or 
1 0 other building blocks. 

Background of the Invention 

1 5 Protein-engineering technology increasingly draws use of fundamental principles in 
natural biological evolution. The key processes in evolution are generation of a vari- 
ety of compounds and selection of favourable variants/features. Through genetic 
inheritance, selected traits are passed on to progenitors and again subjected to 
generation of variety. At the heart of evolution lies generation of mutations and re- 

20 combination events that are reflected in the activity of particular gene products con- 
ferring specific traits to an organism. It has now been widely recognized that the 
same processes can be mimicked in a test tube at the molecular level. 

In one version of in vitro evolution, variation is generated at the onset of the experi- 
25 ment and subsequently the desired molecules are selected through an iterative 
method of selection and amplification. For example, a randomized pool of nucleic 
acids are selected on basis of binding to a particular ligand and subsequently ampli- 
fied by the polymerase chain reaction. The same method may be applied for selec- 
tion of proteins provided that a physical link between the coding nucleic acid and the 
30 encoded protein exist. One method of linking the coding nucleic acid and its gene- 
product is by phage display, where the encoded protein is displayed on a phage that 
carries the coding information. Also methods for directly linking protein and its en- 
coding mRNA have been developed (Fitzgerald, K., Drug Discov Today 2000 
Jun;5(6):253-258). 
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Instead of or in addition to generating variation at the onset of the experiment, varia- 
tion may also be generated during the course of the experiment by employing muta- 
genic PCR. 

5 

Another in vitro evolution method that has proven extremely successful for protein 
engineering is the method of gene shuffling. In this method, normally the starting 
material is DNA sequences encoding homologues proteins. The starting material 
may be homologues genes from different organisms or a mutated pool of one par- 
1 0 ticular gene. Gene shuffling is the process of recombining the starting pool of se- 
quences to generate new gene-sequences that subsequently can be screened for 
particular desired characteristics. 

An artificial version of sexual reproduction was invented by Stemmer (US 
15 6,297,053). This method relies on homologous recombination during the PCR as- 
sembly of random gene fragments from multiple homologous parents, and gener- 
ates crossovers in regions of relatively high-level DNA homology and at loci of se- 
quence identity. Random cleavage of double-stranded DNA corresponding to the 
homologous parental sequences is most commonly achieved by partial DNase I 
20 digestion, but also fragmentation by uracil-DNA-glycosylase after incorporation of 
uracil in the parental DNA has been claimed. The fragmented DNA is subsequently 
used in a PCR reaction without external primers during which homologous se- 
quences will anneal and prime the synthesis of recombined DNA. To amplify full 
length DNA sequences, primers complementary to the 5'- and 3-ends are added in 
25 the last PCR cycles. 

A similar approach has been introduced by Arnold et al. (US 6,153,410). Their StEP 
(staggered-extension) protocol utilizes PCR with repeated cycles of denaturation 
and extremely short annealing/polymerase catalyzed extension times. Thus, short 
30 incompletely extended products are generated starting from primers annealing to 
the 5*- and 3*-ends of the parental genes. In each cycle the growing fragments can 
anneal to different templates based on sequence complementarity and extend fur- 
ther. The process is repeated until full-length gene is formed. Other similar formats 
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employing synthesis interruption ("stuttering") using chemicals or radiation has been 
claimed (US 5,965,408). 

Similarly, the protocol termed "Random priming in vitro recombination" (Shao et al., 
5 (1998), Nucleic Acids Res., 26, 681-683) involves priming double-stranded template 
polynucleotides with excess random-sequence primers and extending with poly- 
merase to generate a pool of short DNA fragments. The short DNA fragments can 
prime on one another based on sequence homology and be recombined and reas- 
sembled into full-length genes by repeated rounds of PGR. 

10 

The above mentioned shuffling formats all depend on a relatively high level of ho- 
mology as the fragmented parental sequences would otherwise not hybridize effi- 
ciently to each other during the assembly step. The Rachitt protocol (random chi- 
meragenesis on transient templates; Coco et al., (2001), Nature Biotechnology, 19, 

15 354-359) somewhat alleviates this problem reporting the shuffling of low homology 
parental gene sequences. Rachitt relies on the ordering, trimming and joining (by 
utilizing nucleases, gap fill-in and ligation) of randomly cleaved parental single- 
stranded DNA fragments of the same polarity annealed onto a transient complemen- 
tary polynucleotide template. This template is constituted by a full-length single- 

20 stranded homologous gene (differing from any of the fragmented parental genes), 
which serves as a scaffold for the process. The scaffold (carrying incorporated uracil 
nucleotides) is rendered non-amplifyable by uracil-DNA-glycosylase treatment, and 
the shuffled fragments are subsequently amplified by PCR. 

25 A further in vitro technique for shuffling heterologous sequences has been devel- 
oped by Borchert et al., 2001 (US 6,291,165). At least one conserved sequence 
region is identified wherein a high degree of homology between the (partially) het- 
erologous sequences exists. The conserved region(s) are then utilized as homolo- 
gous linking points for shuffling/recombining the parental sequences by SOE-PCR 

30 (sequence overlap extension PCR). This method yields only limited diversity in the 
resulting library, since the crossover point(s) are pre-defined by the experimental 
setup (i.e. PCR primer design. 
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Two shuffling methods for sequences with no homology has been reported. Itchy 
(Iterative truncation for the creation of hybrid enzymes) provides non-homologous 
recombination based on generation of N- and C-terminal fragment libraries of two 
genes by progressive truncation of the coding sequences with Exonuclease III fol- 
5 lowed by ligation of the products to make a "head-to-tail" single-crossover hybrid 
library (Ostermeier et al. f (1999), Nature Biotechnology 17, 1205-1209) . The draw- 
back of the Itchy protocol (library members containing only one crossover per gene) 
has been alleviated by the introduction of a method termed "Scratchy 0 . By first per- 
forming the Itchy protocol followed by standard DNA shuffling, multiple crossover 
10 DNA libraries independent of sequence identity can be created (Lutz et al., (2001), 
Proc. Natl. Acad ScL 98, 11248-11253). 

Additional recent reviews on the subject are: Kurtzman et al., Curr Opin Biotechnol 
2001 , 12, 361-370, Coco et al., Nat Biotechnol, 2001 , 19, 354-359 and Kolkman and 
15 Stemmer, Nat Biotechnol, 2001, 19, 423-428. 

A key process in the lifecycle of human immunodeficiency virus (HIV) virus is the 
conversion of the single-stranded RNA genome into a double-stranded DNA ge- 
nome. This process called reverse transcription is catalyzed by the virally encoded 

20 protein; reverse transcriptase. During reverse transcription, inter or intramolecular 
template switches may take place, a phenomenon known as copy-choice, template 
switching or strand transfer. Intramolecular strand transfer reactions are responsible 
for the generation of insertions and deletions, whereas intermolecular template 
switches eventually yield homologues recombination (Negroni and Buc, 2000, 

25 PNAS, 97, 6385-6390). 

Summary of the invention 

30 Prior art methods for gene shuffling involve hybridisation of product to template. Ac- 
cordingly, templates must be sufficiently long to allow specific hybridisation. 

The present invention provides novel methods of gene shuffling. The methods utilise 
template switching during template dependent nucleotide polymerisation to generate 
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novel sequences. The methods are not dependent on the length of templates and 
accordingly even very short templates may be shuffled according to the invention. 

In contrast to prior art shuffling methods, the individual input polynucleotide se- 
5 quences (templates) exploited in the present invention are all single-stranded and of 
the same polarity. The decoding of all of the nucleotide templates thus occurs in the 
same orientation. 

Accordingly, template switched products result from polymerase directed, uni- 
10 directional decoding of a plurality of templates, such as two templates, or more than 
two templates. 

Without being bound by theory, template switched products are obtained according 
to one presently preferred hypothesis by decoding at least two separate nucleotide 

15 templates, thereby providing a single, continuous template switched product. The 

template switched product is obtained in one embodiment by firstly decoding at least 
part of a first template (in the 3'-5' orientation with respect to the first template), and 
subsequently decoding at least part af at least one second template having the 
same polarity (i.e. 3'-5' orientation with respect to the at least one second template), 

20 wherein said decoding process results in the formation of a single, continuously 

growing template switched product, essentially without giving rise to any other prod- 
ucts, such as intermediate products capable of hybridising or annealing to one an- 
other. 

25 The templates according to the invention are single stranded and all of the tem- 
plates have the same polarity and are thus all capable of being decoded in the same 
direction. A short oligonucleotide primer (typically 15-20 nucleotides) complementary 
to sequences in the 3'-end of at least one template can be used for initiation of en- 
zymatic nucleotide polymerization (e.g. reverse transcription). Alternatively, the at 

30 least one template can contain a hair-pin loop-like structure in the 3'-end resulting 

from part of the template folding back on itself. Such a structure can also be used for 
priming. However, without being bound be theory, nucleotide polymerization does 
not necessarily require any of the aforementioned alternatives. 
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It is a first objective of the present invention to provide methods of preparing a 
template switched product, or a plurality of different template switched products, 
wherein said product(s) is encoded by at least part of one first template and at least 
part of at least one second template, wherein said template switched product 
5 comprises at least one predetermined property, said method comprising the steps o 



providing one first and at least one second template molecule; and 



ii) providing a nucleic acid polymerase; and 

10 



Hi) synthesising a plurality of different template switched products by 

contacting sequentially in any order, or simultaneously, at least part of the 
first template and at least part of the at least one second template with 
said polymerase under conditions allowing for template dependent 
nucleotide polymerisation, 

wherein the synthesis of each individual template switched product 
involves at least one template switch, 

and wherein the synthesis of the plurality of different template switched 
products involves a plurality of template switches, 



iv) separating at least one template switched product comprising the at least 
one predetermined property from said plurality of template switched 
products; and 

v) obtaining a template switched product comprising at least one 
predetermined property 

Different template switched products are obtained when a polymerase is switching 
from one location on a first template to different locations on a second template. The 
template switched product will thus contain smaller or larger parts of the first and 
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second template, respectively, depending on the exact position/location where the 
template switch occurred. 

The template switched product can contain either a part of the first template, or the 
5 entire first template (in which case the template switching has occurred at the very 
end of the first template), as well as part of the second template, or the entire 
second template (for the same reason as stated above). However, template 
switching involving a first template and a second template can also produce 
template switched products comprising multiple fragments in any order of said first 
0 and/or second template, but retaining the same polarity/orientation as in the first 

and/or second template. Such template switched products will result from e.g. more 
than one template switch between a first and a second template. In one such 
example the polymerase starts by decoding a first template, switches to a second 
template, and switches back to a first template. As more copies of the first and 
5 second template can be present, the polymerase does not need to switch back to 
the same identical template which was initially decoded. Switching to another copy 
of the first template will also result in a continuous, template switched product 
comprising (in this order) part of the first template linked to part of the second 
template linked to part of the first template. 

!0 

As e.g. "the first template" as used herein above merely indicate any one of a 
number of identical copies of first templates, template switching between a first 
template and at least one second template shall be understood to include template 
switching among any copy of such templates present in the reaction mixture. This is 

25 to say that template switching during template switched product synthesis can occur 
once between the same two template copies, or more than once between the same 
two template copies. Irrespective of which of the aforementioned two alternatives 
take place (one or more template switches between the same two templates), 
template switched product synthesis can additionally involve template switches also 

30 involving the same or different third and fourth templates, wherein one or more 
template switches can occur between said third and fourth templates. 

In summary, different template switched products according to the present invention 
can be generated by template switching involving two templates and at least one 
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template switch, such as two template switches involving the same templates 
(identical templates, or identical copies of said templates), for example three 
template switches, Alternatively or in addition to the aforementioned, different 
template switched products according to the present invention can be generated by 
5 template switching involving more than two templates, wherein all of said templates 
are involved in at least one template switch, i.e. the polymerase either starts 
decoding said template, or switches to said template during template switched 
product synthesis. 

0 It is an objective of the present invention to provide uses of the above-mentioned 
methods for gene shuffling. 

In one embodiment of the present invention the nucleic acid polymerase is a reverse 
transcriptase derived from HIV-1 (human immunodeficiency virus-1). It has sur- 
15 pringsingly been demonstrated by the inventors that the inherent feature of HIV-1 
reverse transcriptase for recombining homologues sequences (Peliska and Benk- 
ovic, Biochemistry, 1994, 33, 3890-3895; Peliska and Benkovic, Science, 1992, 258, 
1112-1 118), may be employed for gene shuffling. 

20 

Brief Description of the Figures 

Fig. 1 illustrates one example of gene shuffling by means of template switched 
product synthesis. Target sequences can be PCR amplified and this creates DNA 

25 templates for RNA synthesis in vitro. Naturally occuring polymerases or poly- 
merases modified by recombinant gene technology or protein engineering can be 
used, including Reverse Transcriptase and T7 RNA polymerase, as well as other 
template switching polymerases. When it is desirable to facilitate fragmentation with 
iodine, phosphorothioate ribonucleotide analogues can be incorporated. After frag- 

30 mentation, the sequences to be shuffled can be mixed, subjected to template 

switching, or recombined. Variations in sequence for the two sequences are indi- 
cated by letters in upper and lower case, respectively. 
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Fig. 2 illustrates principal differences between conventional shuffling methods (illus- 
trated by the shuffling protocol in pane! A) and the present invention (panel B). 

(A) In conventional shuffling protocols, sequences of both polarities (in this case 
5 double-stranded DNA) are used for input material. The arrow pointing back to the 

denaturation and annealing step indicates the cyclic nature of the process where 
repeated cycles of denaturation, annealing to new complementary strands and ex- 
tension w. Taq polymerase is required for PCR based assembly of the gene frag- 
ments into the full-length product. 

10 

(B) In the present invention all input sequences are single stranded and of the same 
polarity. In this example, random fragmentation of the input RNA causes template 
switching by reverse transcriptase resulting in the synthesis of the template switched 
product carrying a recombination of the sequences present in the two input tem- 

15 plates. In the present example, reverse transcription is initiated from a short hairpin 
structure in the 3'-end of the templates. Reverse transcription can also be initiated 
from a short (typically 15-20 nt) DNA oligonucleotide annealed to the very 3'-end of 
the templates. 

20 The polarity of the polynucleotide templates (5' to 3' direction) is indicated by arrow- 
heads. Variation in sequence between the parental input sequences are indicated 
by letters in upper and lower case, respectively. Reverse transcriptase is indicated 
by a cross-hatched oval. 

25 

Definitions 

Activity 

The term "activity" refers to any specific activity that a molecule is capable of per- 
30 forming or encode. For example activity may be that a molecule is capable of asso- 
ciating with a specific binding partner with a specific affinity, capable of catalysing a 
specific reaction, capable of inhibiting a specific reaction, capable of effecting a par- 
ticular cellular response, capable of interacting with a physical surface. 
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According to the present invention a first activity is usually associated with a first 
template or a product encoded by said first template, and a second activity is usually 
associated with a second template or a product encoded by said second template. 

5 

A third activity is usually associated with a template switched product or a product 
encoded by a template switched product. A third activity is said to be different to a 
first activity or a second activity when if its different activity can be determined by 
using any assay capable of demonstrating physical or functional activities associ- 
10 ated with any of the aforementioned templates. By way of example, association with 
a specific binding partner with a higher or lower affinity or specificity is a different 
activity. 

Conditions allowing for template dependent nucleotide polymerisation 

1 5 "Conditions allowing for template dependent nucleotide polymerisation" are any 

conditions under which a nucleic acid polymerase is capable of catalysing template 
dependent nucleotide polymerisation. 

The specific conditions will depend on the specific nucleic acid polymerase. In gen- 
20 eral however, conditions allowing for template dependent nucleotide polymerisation 
comprises nucleotides, buffer and salt(s). Furthermore, the specific conditions may 
involve incubation at a specific temperature. 

Decoding 

25 

Decoding is the process whereby the information content of one nucleotide template 
is transferred by means of a nucleic acid polymerase to another template, or to a 
biomolecule. See "Encoding". 

30 Encoding 



A nucleic acid or oligonucleotide analog is said to encode another molecule, if the 
sequence of said nucleic acid or oligonucleotide analog specifies the sequence or 
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composition of said other molecule. Examples of encoding are: i) RNA encodes a 
DNA when a RNA molecule is reverse transcribed by e.g. a reverse transcriptase to 
form a DNA strand whose sequence is complementary to the RNA; ii) mRNA en- 
codes a peptide when the genetic code of the mRNA is translated into a peptide by 
5 a ribosome, where the sequence of the peptide is determined by the sequence of 
triplet codons of the mRNA; iii) DNA encodes a polymer or scaffolded molecule, 
when the single stranded DNA is first replicated using nucleotide and/or nucleotide 
derivatives and a polymerase, whereafter appendices attached to the nucleo- 
tides/nucleotide derivatives are covalently linked to form a polymer or scaffolded 
1 0 molecule, where the sequence or composition of the polymer or scaffolded mole- 
cule, respectively, is determined by the sequence of the template DNA strand. 

Gene shuffling 

Gene shuffling is the process wherein one or more nucleic acids are produced com- 
15 prising sequences derived from two or more different starting nucleic acids. Gene 

shuffling may for example be used to obtain novel nucleic acids with specific activity. 

Nucleic acid polymerase 

A nucleic acid polymerase is a catalyst, e.g. an enzyme or ribozyme, capable of 
20 catalysing template dependent polymerisation of nucleotides. 

Preferred nucleic acid polymerases according to the invention are reverse transcrip- 
tases, i.e. enzymes capable of catalysing polymerisation of deoxynucleotides using 
a ribonucleic acid as template. 

25 

In one preferred aspect of the present invention the reverse transcriptase is derived 
from a human immunodeficiency virus (HIV). 

Nucleotides 

30 Nucleotides according to the invention includes ribonucleotides comprising a nu- 

cleobase selected from the group consisting of adenine (A), uracil (U), guanine (G), 
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and cytosine (C), and deoxyribonucleotide comprising a nucleobase selected from 
the group consisting of adenine (A), thymine (T), guanine (G), and cytosine (C). 

Nucleobases are capable of associating specifically with one or more other nucleo- 
5 bases via hydrogen bonds. Thus it is an important feature of a nucleobase that it 

can only form stable hydrogen bonds with one or a few other nucleobases, but that it 
can not form stable hydrogen bonds with most other nucleobases usually including 
itself. The specific interaction of one nucleobase with another nucleobase is gener- 
ally termed "base-pairing". 

0 

The base pairing results in a specific hybridisation between predetermined and 
complementary nucleotides. Complementary nucleotides according to the present 
invention are nucleotides that comprise nucleobases that are capable of base- 
pairing. 

5 

Of the naturally occurring nucleobases adenine (A) pairs with thymine (T) or uracil 
(U); and guanine (G) pairs with cytosine (C). Accordingly, e.g. a nucleotide 
comprising A is complementary to a nucleotide comprising either T or U, and a 
nucleotide comprising G is complementary to a nucleotide comprising C. 



Nucleotide analogues 

Nucleotide analogues include any nucleotide analogues capable of specific bas- 
pairing, for example nucleotide analogues may be derivatives of naturally occurring 
nucleotides, or they may comprise a different backbone or different base moiety 
than naturally occurring nucleotides. Nucleotide analogues according to the present 
invention are capable of specific hybridisation with at least one other nucleotide; and 
association with the active site of at least one nucleic acid polymerase. 



Examples of nucleotide analogues according to the invention includes but are not 
limited to PNA, LNA, dideoxynucleotides, deoxyribonucleotides, and nucleotides 
containing unnatural bases. 
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Predetermined property 

The predetermined property of a template switched product may be any desirable 
property. For example the predetermined property may be specific association with 
5 a specific binding partner or the property may be that the template switched product 
encodes another product with a predetermined activity. 

Scaffolded molecule 

10 A scaffolded molecule according to the present invention is a molecule comprising a 
scaffold to which more than 2 chemical entities are attached. Said chemical entities 
may for example be monomer units capable of constituting part of a polymer. 

A scaffold within the meaning of the present invention is a molecular moiety to which 
15 more than 2 chemical entities may be directly attached. Hence a scaffold preferably 
comprises more than two reactive groups. 

Template 

20 The template is preferably a single strand of nucleotides or nucleotide analogs. 
When the template comprises a strand of nucleotides, the nucleotides may be 
natural or non-natural, and may be linked by e.g. phosphorothioate bonds or natural 
phosphodiester bonds. Nucleotide analogs may be linked e.g. by amide bonds, 
peptide bonds, or any equivalent means capable of linking nucleotide analogs. The 

25 sugar moiety of a nucleotide or nucleotide analog may be a ribose or a deoxyribose, 
a ribose derivative, or any other molecular moiety. 

A template comprises a sequence of n coding elements, wherein n is an integer of 
more than 5. A coding element is an element that directs insertion of a specific 
30 group, for example a specific nucleotide or nucleotide analogue into the product. 

In one preferred embodiment of the present invention, the template is a nucleic acid 
or a nucleic acid analogue. More preferably, the template is a nucleic acid, which is 
capable of being transcribed by a polymerase comprising RNase H activity, 
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including a reverse transcriptase. Accordingly, it is preferred that the starting 
template comprises or consists of RNA or a derivative or analogue thereof. The 
methods according to the invention may involve use of more than one template, for 
example the method may involve the use of a first and a second template. 

5 

Template Switch 

Template switching occurs when a nucleic acid polymerase which is synthesizing a 
templated product (template switched product) by decoding at least part of a first 

10 template, shifts from said first template to a second template, which is subsequently 
at least partly decoded during the synthesis of the remaining part of said templated 
product. It is possible for the polymerase to decode (from the location of the switch) 
the entire remaining part of the second template, to decode part of said remaining 
part of said second template and, in the latter case, to switch template again to e.g. 

1 5 any position of said first template (or an identical copy thereof), or switch to any 
other template. 

The templated product is thus synthesized by a combination of nucleic acid poly- 
merase directed decoding of at least part of said first template and nucleic acid po- 

20 lymerase directed decoding of at least part of said second template. Template 

switching and synthesis of a template switched product according to the invention 
involves the same nucleic acid polymerase (unlike PCR where different poly- 
merases are used to decode templates having different polarity) and the synthesis 
of a template switched product is one continuous process that does not involve de- 

25 naturation and renaturation of double stranded nucleic acids (unlike PCR). 

Template switch signal 

A template switch signal is any modification of a template, capable of inducing a 
30 higher rate of template switches of a nucleic acid polymerase. A template switch 

signal can be introduced into a template simultaneously with synthesis of said tem- 
plate or the template switch signal may be introduced after synthesis of the tern- 
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plate. Preferably, the template switch signal is a signal capable of introducing a 
pause in nucleotide polymerisation catalysed by a nucleic acid polymerase. 

Template switched product 

5 

A template switched product according to the present invention is the product of a 
process involving contacting sequentially in any order or simultaneously at least part 
of a first template and at least part of a second template with a nucleic acid poly- 
merase under conditions allowing for template dependent nucleotide polymerisation, 

10 wherein the synthesis of each individual template switched product involves at least 
one template switch. The template switched product is synthesised in a continuous 
process as the length of the template switched product strand increases during de- 
coding of (initially) the first template, or a part thereof, and (subsequently) during 
decoding of the second template, or a part thereof. Thus, it is possible to regard the 

1 5 template switched product as a chimera comprising first template sequence (or a 
one to one relationship with such a sequence), and second template sequence, (or 
a one to one relationship with such a sequence). 

The template switched product is preferably a nucleic acid. Preferably, the template 
20 switched product is a nucleic acid, which may be the product of a reverse 

transcription reaction. Accordingly, it is preferred that the template switched product 
comprises or consists of DNA, or a derivative or analogue thereof 

25 Detailed Description of the Invention 

Methods of preparing template switched products 

The present invention relates to methods of preparing a template switched product 
30 encoded by at least part of one first and part of one second template, wherein said 
product comprises at least one predetermined property, said method comprising the 
steps of 
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i) providing at least one first and at least one second template molecule; 
and 



ii) providing a nucleic acid polymerase; and 



iii) synthesising a plurality of different template switched products by 

contacting sequentially in any order or simultaneously at least part of the 
10 first template and at least part of the at least one second template with 

said polymerase under conditions allowing for template dependent 
nucleotide polymerisation, 

wherein the synthesis of each individual template switched product 
15 involves at least one template switch, and 

wherein the synthesis of the plurality of different template switched 
products involves a plurality of template switches, 

20 iv) separating at least one template switched product comprising the at least 

one predetermined property from said plurality of template switched 
products; and 

v) obtaining a template switched product comprising at least one 
25 predetermined property 

Such methods may be useful for several purposes, however, the methods are in 
particular useful for gene shuffling. During sexual reproduction homologous 
recombination contributes to the continous evolution of the genome. Gene shuffling 
30 is an vitro process mimicking natural evolution, by creating novel nucleic acid 

sequences by "shuffling" two or more template nucleic acid sequences. Subsequent 
to shuffling of the sequences, sequences comprising desired properties can be 
selected. 
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The individual steps of the methods may be performed more than once. In 
particular, it is frequently preferred that the entire sequence of steps are repeated at 
least once. For example the sequence of steps may be repeated once, such as 
twice, for example 3 times, such as 4 times, for example 5 times, such as in the 
5 range of 5 to 10 times, for example in the range of 10 to 20 times, such as in the 
range of 20 to 50 times, for example more than 50 times. 

In particular, when the method is repeated, it is preferred that the template switched 
products comprising at least one predetermined property obtained by performed the 

1 0 method the first time are used as templates, such as first templates, second 

templates and/or further templates when the method is repeated. Hence, template 
switched product(s) may be used as a first template and/or second template when 
the method is repeated. Alternatively, it is preferred that the method is repeated 
using templates encoded by the template switched products comprising 

15 predetermined property(ies) obtained by performing the method the first time. 

By way of example, if the templates are RNA molecules and the template switched 
products are DNA molecules, novel templates may be synthesised by transcription 
of said DNA molecules. 

20 

By using the template switched products obtained from the method as templates for 
subsequent rounds of the method, it is possible to obtain template switched 
products comprising improved predetermined property(ies). 

25 It should be noted that conditions during any of the steps in the method employing 
template directed polynucleotide synthesis can be modulated as to decrease the 
fidelity of polynucleotide synthesis. In some cases the added sequence diversity 
generated by error-prone conditions can be advantageous. This also allows for se- 
quence evolution during repeated rounds of the method, such that a resulting 

30 template switched product can contain sequence information (e.g. mutations) not 
present in the input templates used at the beginning of the method. 



In one preferred embodiment the method furthermore comprises the step of 
introducing one or more template switch signals into the first template, and/or the 
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second template and/or any further templates. Preferably said step is performed 
subsequent to step i) and prior to step hi). Examples of suitable template switch 
signals are given herein below. 

5 Prior art gene shuffling methods may require a heat denaturation / renaturation type 
of annealing between the template(s) and the generated, recombined product(s). It 
is one advantage of the present invention, that it is not required that the template 
switched product(s) anneals in this way (heat denaturation / renaturation type of 
annealing) to any of the templates. 

10 

Hence, in one embodiment the methods described herein can involve annealing 
between the first template and the template switched product, but said methods do 
not need to comprise such a feature. Furthermore, preferably the methods do not 
require and preferably do not involve annealing between the second template and/or 
15 any further template and the template switched product. 

In one preferred embodiment of the invention the synthesis of template switched 
products involves a reverse transcription reaction. The templates (first template, 
second templates and any further templates) for a reverse transcription reaction are 
20 RNA molecules or analogues thereof. A reverse transcription reaction may for ex- 
ample be performed as follows: 

RNA templates are mixed in appropriate ratios, a reverse primer is annealed to the 
template RNAs and reverse transcription is initiated by the addition of reverse tran- 
25 scriptase, such as HIV reverse transcriptase. After reverse transcription, which 

should involve at least one template switch, in general a PCR is performed on the 
produced ssDNA (=template switched products) to facilitate screening or selection 
for a desired property of the template switched products or to generate templates for 
another round of synthesis of template switched products. 

30 

Suitable reverse primers includes any primer capable of annealing to the template 
RNA. 



Templates 
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Templates to be used with the present invention may be any template according to 
the definition herein above. Preferably the template is a polynucleotide comprising 
or consisting of nucleic acids and/or nucleic acid analogues. 

The methods described herein involve the use of at least a first template and at least 
one second template. However, the methods can involve the use of more than 2 
different templates, for example 3, such as in the range of 3 to 5, for example 5 to 
10, such as 10 to 20, for example 20 to 50, such as 50 to 100, for example 100 to 
250, such as 250 to 500, for example 500 to 1000, such as more than 1000 different 
templates. 

The methods can involve template switching between 2 different templates, or more 
than 2 different templates, for example 3, such as in the range of 3 to 5, for example 
5 to 10, such as 10 to 20, for example 20 to 50, such as 50 to 100, for example 100 
to 250, such as 250 to 500, for example 500 to 1000, such as template switching 
between more than 1000 different templates. 

The method in yet another embodiment method involves more than 2 different 
template switches, such as 3, for example 4, such as 5, for example in the range 
from 5 to 10, such as from 10 to 20, for example from 20 to 50, such as from 50 to 
100, for example from 100 to 200, such as more than 200 different template 
switches. 

One or more of the template switches of the present method are different or unique 
switches involving the formation of different or unique template switched products. It 
is to be understood that such products are the result of at least one switch from a 
predetermined position of a first template to a different position of at least one 
second or further template, or the result of at least one switch from a different 
position of a first template to a predetermined position of at least one second or 
further template, or the result of at least one switch from a different position of a first 
template to a different position of at least one second or further template, wherein 
switches to or from different positions are unique for the method, and wherein the 
method can involve at least one other switch to or from said predetermined 
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position(s). Accordingly, each different template switched product of the invention is 
the result of at least one unique template switching event, i.e. a template switching 
event being different from other template switching event(s), thus generating a 
unique boundary in the continuous, template switched product between sequence 
5 originating from the first template and sequence originating from the at least one 
second template, or a unique boundary between sequence originating from the at 
least one second template and any further template. 

In one embodiment the number of different template switched products generated is 
10 preferably more than 20, for example more than 50, such as more than 500, for 

example more than 1000, such as more than 10000, for example more than 100000, 
such as more than 1000000. In another embodiment the number of different 
template switched products generated is preferably in the range of from 1 to 
1000000, such as from 1 to 500000, for example from 5 to 500000, such as from 5 
1 5 to 250000, for example from 1 0 to 200000, such as from 1 0 to 1 50000, for example 
from 15 to 100000, such as from 5 to 250000, for example from 10 to 200000, such 
as from 10 to 150000, for example from 15 to 100000, such as from 20 to 200000, 
for example from 20 to 150000, such as from 25 to 1 50000, for example from 25 to 
125000, such as from 25 to 100000, for example from 50 to 100000, such as from 
20 50 to 50000, for example from 100 to 50000. 

It is thus within the scope of the present invention to provide in a single reaction 
mixture more than 10 different templates and to obtain more than 100 different 
template switched products. 

25 

In a preferred embodiment of the present invention the first template, the second 
template and any further template are individually selected from the group consisting 
of nucleic acid molecules and analogues thereof. 

30 The first template, the second template and any further template may pair-wise be 
identical, similar or different. Preferably, however the first, the second and any 
further templates are pair-wise similar to one another, but not identical. For example, 
the first, the second and any further template may pair-wise individually share in the 
range of 10 to 99%, such as 20 to 99%, for example 30 to 99%, such as 40 to 99%, 
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for example 50 to 99%, such as 60 to 99%, for example 70 to 99%, such as 80 to 
99%, for example 90 to 99%, such as 95 to 99%, for example 30 to 95%, such as 40 
to 95%, for example 50 to 95%, such as 60 to 95%, for example 70 to 95%, such as 
80 to 95%, such as 90 to 95%, for example 40 to 90%, such as 50 to 90%, for 
5 example 60 to 90%, such as 70 to 90%, for example 80 to 90% identity. 

Furthermore, it is possible that the first, the second and any further template may 
pair-wise individually share a region comprising 2, for example 3, such as 4, such as 
5, such as 6, for example 7, such as 8, for example 9, such as 10, for example 1 1 , 
10 such as 12, for example 13, such as 14, for example 15 nucleotides which share at 
least 40%, such as at least 50%, for example at least 60%, such as at least 70%, for 
example at least 80%, such as at least 90%, for example at least 95%, such as at 
99%, for example around 100% identity. 

1 5 Hence, for example a region comprising 5 nucleotides in the first template may 

share at least 40%, such as at least 60%, for example at least 80%, such as around 
100% identity with a region comprising 5 nucleotides in the second template. 

Preferably, templates according to the invention are any nucleic acid or nucleic acid 
20 analogue capable of being replicated or transcribed by a nucleic acid polymerase. 
Hence, depending of the nucleic acid polymerase that may be useful for the 
particular embodiment of the invention, the template should preferably be selected 
so that it may serve as template for that particular nucleic acid polymerase. 

25 In one preferred embodiment of the invention the nucleic acid polymerase is a 
reverse transcriptase, more preferably HIV reverse transcriptase. Hence, in that 
embodiment of the invention it is preferred that the first template, the second 
template and any other templates are selected from the group consisting of 
templates that are capable of being template for reverse transcriptase. 

30 

Accordingly, in that embodiment it is preferred that the first template and/or the 
second template and/or any further templates are RNA molecules or RNA 
analogues. 
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In one embodiment of the invention the first template and/or the second template 
and/or any further templates encodes a polymer. Said polymer may be any of the 
polymers described herein below. 

5 In another embodiment of the invention the first template and/or the second tem- 
plate and/or any further templates encodes a scaffolded molecule. Said scaffolded 
molecule may be any of the scaffolded molecules described herein below. 

Template switched product 

10 

A template switched product according to the present invention is a product of a 
method involving contacting at least part of a first template and at least part of a 
second template with a nucleic acid polymerase under conditions allowing for 
template dependent nucleotide polymerisation, wherein the synthesis of each 
15 individual template switched product involves at least one template switch. 

The first and the second template may be contacted by the polymerase 
simultaneously or sequentially in any order. When performing such a method, 
usually a plurality of different template switched products are generated, wherein the 
20 synthesis of each template switched product involves one or more template 

switches, which are different to the template switches in the synthesis of different 
template switched products. 

Hence, the template switched products according to the invention are products of a 
25 nucleic acid polymerase catalysed template dependent nucleotide polymerisation. 
Accordingly, one or more template switched products according to the present 
invention comprises or consists of nucleic acids or nucleic acids analogues. 
Preferably, all template switched products comprises or consists of nucleic acids or 
nucleic acids analogues. 

30 

In one embodiment of the invention at least one, but preferably all template switched 
products are nucleic acids. 
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The present invention also relates to separating at least one template switched 
product with a predetermined property from the plurality of template switched 
products synthesised. 

5 Said predetermined property may be an intrinsic activity of the template switched 
product or the predetermined property may be that the template switched product 
encodes a molecule comprising a desired activity. 

For example, the template switched product may be a nucleic acid or nucleic acid 
10 analogue with catalytic properties, for example the template switched product may 
comprise or consist of a ribozyme. Alternatively, the template switched product may 
be a nucleic acid or nucleic acid analogue capable of interfering with transcription 
and/or translation of one or more nucleic acids, for example the template switched 
product may be an antisense or antigene nucleic acid or nucleic acid analogue, 

15 

Frequently, the template switched product according to the present invention 
encodes another molecule. For example, the template switched product may 
encode a polymer, such as any of the polymers described herein below. "Encode" 
should be understood to designate the process in which the sequence of a nucleic 
20 acid or nucleic acid analogue is translated or transcribed/translated into a molecule, 
such as a polypeptide by one or more enzymatically catalysed processes. Hence, 
for example the template switched product may encode a polypeptide. 

It is also comprised within the present invention that the template switched product 
25 may encode a molecule, which is not a linear polymer, for example the template 

switched product may encode a scaffolded molecule, such as any of the scaffolded 
molecules described herein below. 

It is also comprised within the scope of the present invention, that the template 
30 switched product may be amplified after synthesis. For example, the methods 

described herein may comprise amplification of the plurality of template switched 
products. Said amplication may be performed prior to separating template switched 
products comprising a specific predetermined property from the plurality of template 
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switched products. However, it is also possible that only the template switched 
products comprising a specific predetermined property are amplified. 

Amplification may be performed by a number of different methods that should be 
5 selected according to the nature of the template switched product. If the template 

switched products comprises or consists of DNA they may for example be amplified 
by polymerase chain reaction (PCR). If the template switched products comprises or 
consists of RNA, they may for example be amplified by a reverse-transcription PCR. 

10 Although the methods according to the invention involves the use of at least 2 

different templates, however, it is possible to use more than 2 templates. A large 
number of different starting templates in general allows for the synthesis of a large 
variety of different template switched products. Hence, if it is preferred to generate a 
large number of different template switched products, preferably a large number of 

1 5 different templates may be employed. 

It is also possible to e.g. shuffle large polynucleotide fragments by providing polynu- 
cleotide fragments which are at least pairwise overlapping. Accordingly, it is compri- 
sed within the present invention that multiple templates containing partially overlap- 

20 ping sequences can be employed resulting in the formation of a template switched 
product emerging from multiple template switching events between such templates, 
wherein said template switched products are longer than the length of each of the 
individual input templates, but shorter than the sum or combined length of the indivi- 
dual input templates. For example, multiple shorter and partly overlapping templates 

25 can be designed and synthesized which each encode only part of a given full-length 
polynucleotide sequence, and using such a plurality of partly overlapping templates 
results in the formation of a template switch product. The template switch product 
results from decoding at least part of each of the templates. Each of the above- 
mentioned shorter templates can (at least in part) correspond to functional regions 

30 or domains of the encoded product, (e.g. a sequence specifying the nature of a 

chemical entity of a scaffolded molecule, protein structural or functional domains or 
epitopes), exons of a gene sequence, or the shorter templates can correspond to 
arbitrarily chosen segments of the full-length polynucleotide sequence. 
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It should be understood that each of said above-mentioned shorter templates can be 
a population of homologous templates, for example differing from one another by 
carrying various nucleotide substitutions, deletions or additions. Such templates 
carrying substitutions, deletions and/or additions are termed "functional homolo- 
5 gues", "functional equivalents" or "varients 8 and examples of substitutions, deletions 
and/or additions are described herein elsewhere in more detail. It will be understood 
that a variant or functional equivalent can be identified for both the template, or al- 
ternatively, for the biological functionality and/or activity encoded by said template or 
the resulting template switched product generated when exploiting functional homo- 
10 logue templates. 

For example, the methods according to the invention may involve the use of more 
than 2 different templates, such as 3, for example 4, such as 5, for example in the 
range from 5 to 10, such as from 10 to 20, for example from 20 to 50, such as from 
15 50 to 100, for example from 100 to 200, such as more than 200 different templates. 

In some embodiments of the invention, one or more of the templates comprises at 
least one template switch signal. Said template switch signal may be inserted into 
the template during synthesis of the template or subsequent to synthesis of the 
20 template. Examples of useful template switchs signals are given herein below. 

The templates to be employed with the present invention may be synthesised by 
any suitable method known to the person skilled in the art. The method to be used is 
dependent on the kind of template. 

25 

In one preferred embodiment of the invention the template comprises RNA or an 
RNA analogue. 

RNA may be synthesised in vitro or in vivo. For example RNA templates for reverse 
30 transcription may be directly purified from cell extracts or chemically synthesised. 
Preferably RNA templates are synthesised using in vitro transcription reactions, in 
which a double-stranded DNA template directs the synthesis of single-stranded 
RNA. Suitable in vitro transcription reactions are known to the person skilled in the 
art. 
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Double-stranded DNA templates to be used in transcription reactions may for exam- 
ple be generated by PCR. Primers used in the PCR reaction are designed to flank 
the target sequences that are to be used as first templates/second templates or fur- 
5 ther templates in the methods of the invention. One primer contains a 5' extension 
that contains the promoter sequence for a particular RNA polymerase or a functional 
homologue thereof, e.g. T7 RNA polymerase, such that the promoter will be fused to 
target sequences in the amplified product. Target sequences may be any DNA and 
RNA sequences that can be amplified by PCR or RT-PCR. DNA-templates may 
10 also be chemically synthesized or be produced by cloning techniques fusing a pro- 
moter to target sequences. 

Once a suitable double-stranded DNA template is available RNA synthesis may be 
performed using an in vitro transcription reaction. Double-stranded DNA templates, 
15 ribonucleotides and RNA polymerase is mixed in reaction buffer and incubated un- 
der conditions allowing for transcription to proceed. The exact conditions depends 
on the particular RNA polymerase and may for example include incubation at 37°C. 

It should be noted that transcription can be carried out using reagents or conditions 
20 affecting the fidelity of the process which in some experimental settings could prove 
useful in generating added diversity to the input sequences. For instance, a T7 RNA 
polymerase mutant with increased nucleotide substitution error rate has been repor- 
ted (Brakmann & Grzeszik, 2001, Chembiochem 2, 212-219). 

25 It is also comprised within the present invention that one or more templates are 

synthesised chemically by conventional method. For example, when the template is 
an RNA molecule, said RNA molecule may be chemically synthesised. However, in 
particular when the template is a nucleic acid analogue or comprises nucleotide 
analogues it is frequently preferred, that such templates are synthesised chemically. 

30 

In some embodiments the template may be synthesised in vivo. In particular when 
the template is a naturally occurring nucleic acid, it is frequently useful that the 
template is synthesised in vivo. For example the template may be directly purified 
from a cell extract, i.e. when the template is an RNA or a DNA molecule said RNA or 
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DNA molecule may be purified from a cell extract by any conventional method 
known to the person skilled in the art. 

The cell extract, may be prepared from a number of different cells depending on the 
5 specific embodiment, such as but not limited to mammalian cells including human 
cells, insect cells, plant cells, bacteria, fungi or yeast, 

i n preferred embodiments of the invention, one or more templates comprises at 
least one template switch signal. Template switch signals are described in more 
10 detail herein below. One kind of template switch signal that may be employed with 
the present invention is a breakage of the template. If the template is an RNA 
molecule, said break may for example be inserted directly into the RNA molecule. 

In order to generate the maximal diversity breaks may be introduced randomly in a 
15 variety of positions, hence generating a number of different DNA or RNA molecules 
of various length, that each may be transcribed or replicated to generate template. 
In another embodiment of the invention one or more templates have been prepared 
by a method comprising the steps of 



20 i) providing a RNA molecule; and 

ii) fragmenting said RNA molecule into RNA fragments; and 

Hi) replicating one or more of said RNA fragments; and 

iv) thereby obtaining one or more different templates 



25 Fragmentation may be performed by any method available to the person skilled in 
the art. For example the DNA or RNA may be fragmented by enzymatic digestion, 
by limited enzymatic digestion or by limited hydrolysis. The enzymatic digestion may 
be performed by any suitable nuclease, such as an endonuclease. In case of DNA 
the endonuclease may for example be a restriction enzyme or DNase I, in case of 

30 RNA the endonuclease may for example be an RNase. 



Furthermore, it is possible to incorporate phosphorothioate analogues of ribonucleo- 
tide during in vitro transcription, which will allow controlled 5' (prime) to 3' (prime) 
exonucleolytic truncation of RNAs as exonuclease stalls at a phoshorothioester. 
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Polymers and scaffolded molecules 

A polymer may be any linear or branched molecule consisting of a plurality of 
5 monomers. The monomers constituting a polymer may be identical, similar or 
different. In one embodiment the polymer may for example be a peptide or a 
polypeptide. If the polymer is a peptide or a polypeptide the monomers are selected 
from the group consisting of a-amino acids. 

1 0 In one embodiment of the invention the template switched product encodes polymer 
comprising or consisting of amino acids. The term "amino acid" should be under- 
stood broadly and includes but is not limited to a-amino acids, p-amino acids, y- 
amino acids and co-amino acids. Amino acids may be in any steric form, for example 
amino acids may be L-amino acids or D-amino acids. In one embodiment of the pre- 

1 5 sent invention the polymer consists of or comprises naturally occurring amino acids. 

Non-limited examples of polymers includes: 

alpha-, beta-, gamma-, and omega-peptides 
20 • mono-, di- and tri-substituted peptides 
L- and D-form peptides 

cyclohexane- and cyclopentane-backbone modified beta-peptides 
vinylogous polypeptides 
glycopolypeptides 
25 • polyamides 

vinylogous sulfonamide peptide 
Polysulfonamide 

conjugated peptide (i.e., having prosthetic groups) 
Polyesters 
30 • Polysaccharides 
Polycarbamates 
Polycarbonates 
Polyureas 
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poly-peptidylphosphonates 
Azatides 

peptoids (oligo N-substituted glycines) 
Polyethers 

ethoxyformacetal oligomers 

poly-thioethers 

polyethylene glycols (PEG) 

Polyethylenes 

Polydisulfides 

polyarylene sulfides 

Polynucleotides 

PNAs 

LNAs 

Morpholinos 

oligo pyrrolinone 

polyoximes 

Polyimines 

Polyethyleneimine 

Poly acetates 

Polystyrenes 

Polyacetylene 

Polyvinyl 

Lipids 

Phospholipids 

Glycolipids 

polycycles (aliphatic) 

polycycles (aromatic) 

polyheterocycles 

Proteoglycan 

Polysiloxanes 

Polyisocyanides 

Polyisocyanates 

Polymethacrylates 
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Non-limiting examples of monomers constituting a polymer according to the inven 
tion includes: 

Hydroxyls 

5 • Primary, secondary, tertiary amines 
• Carboxylic acids 

Phosphates, phosphonates 
Sulfonates, sulfonamides 
Amides 
1 0 • Carbamates 
Carbonates 
Ureas 

Alkanes, Alkenes, Alkynes 
Anhydrides 
1 5 • Ketones 
Aldehydes 
Nitatrates, nitrites 
Imines 

Phenyl and other aromatic groups 
20 • Pyridines, pyrimidines, purines, indole, imidazole, and heterocyclic bases 
Heterocycles 
polycycles 
Flavins 
Halides 

25 • Metals 
Chelates 

Mechanism based inhibitors 
Small molecule catalysts 
Dextrins, saccharides 
30 • Fluorescein, Rhodamine and other fluorophores 
Polyketides, peptides, various polymers 
Enzymes and ribozymes and other biological catalysts 
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Functional groups for post-polymerization/post activation coupling of func- 
tional groups 

Drugs, e.g., taxol moiety, acyclovir moiety, "natural products" 
Supramolecular structures, e.g. nanoclusters 
5 • Lipids 

Oligonucleotides, oligonucleotide analogs (e.g., PNA, LNA, morpholinos) 

In one embodiment of the invention the one or more template switched products 
encodes a scaffolded molecule. A scaffolded molecule according to the present 
10 invention comprises a molecular entity covalently linked to at least 3 functional 

groups. The functional group may be any molecular entity, the identity of which is 
encoded by the template or template switched product. For example any of the 
above mentioned monomers may be a functional group. 

15 Activity of the templates and the template switched products. 

In preferred aspects the present invention relates to methods of providing template 
switched products comprising a predetermined activity. For example the methods 
may be used to prepare template switched products with improved activity, using 
20 templates comprising said activity. 

By way of example, if it is desired to obtain a molecule with improved affinity to a 
specific compound, one or more templates encoding molecules with affinity for said 
specific compound may be used. The template switched products may be analysed 
25 and products encoding molecules with enhanced affinity for said specific compound 
may be selected. 

Hence, in one preferred embodiment the methods according to the invention 
comprises that 

30 

i) the first template comprises a first activity or encodes a molecule 
comprising a first activity; and 
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ii) the second template comprises a second activity or encodes a molecule 
comprising a second activity; and 

iii) the predetermined property of the template switched product is a third 
activity, either comprised within the template switched product or in a 

5 molecule encoded by the template switched product; and 

wherein the first activity is different to the third activity and the second activity is 
different to the third activity. 

10 The activity of the first template, the second template or any other templates or of 
the recombinded products may individually be for example that the template(s) or 
product(s) are capable of associating with a specific binding partner with a specific 
affinity, capable of catalysing a specific reaction, capable of inhibiting a specific 
reaction, capable of effecting a particular cellular response, capable of interacting 

15 with a physical surface or that the template(s) or product(s) encode a molecule 

capable of associating with a specific binding partner with a specific affinity, capable 
of catalysing a specific reaction, capable of inhibiting a specific reaction, capable of 
effecting a particular cellular response or capable of interacting with a physical 
surface. 

20 

Nucleic acid polymerase 

A nucleic acid polymerase is a catalyst, e.g. an enzyme or ribozyme, capable of 
catalysing template dependent polymerisation of nucleotides. Preferred nucleic acid 

25 polymerases according to the present invention includes such nucleic acid 

polymerases that are capable of performing one or more template switches during 
said polymerisation. I.e. preferred nucleic acids polymerases are capable 
of jumping from one template to another during transcription or replication, thereby 
synthesising a product which is templated by two or more different templates. 

30 Preferred nucleic acid polymerases comprises a RNase H activity. 
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The nucleic acid polymerase according to the invention may be a naturally occurring 
nucleic acid polymerase or it may be a nucleic acid polymerase produced by 
recombinant technology. 

5 In a preferred embodiment the nucleic acid polymerase is a reverse transcriptase, 
i.e. the nucleic acid polymerase is capable of catalysing polymerisation of DNA 
using an RNA template. Preferred reverse transcriptases according to the present 
invention includes but is not limited to HIV reverse transcriptase and functional 
homologues thereof, reverse transcriptase from MoMLV (moloney murine leukaemia 
10 virus) and functional homologues thereof, SNV (spleen necrosis virus) reverse 

transcriptase and functional homologues thereof or MLV (murine leukaemia virus) 
reverse transcriptase and functional homologues thereof. 

It is also comprised within the present invention to use a nucleic acid polymerase, 
15 such as a reverse transcriptase, which has been genetically engineered to 
enhanced frequency of template switching. 

Template switch signals 

20 In a preferred embodiment of the invention the methods described herein 

furthermore comprises introducing into said first and/or second template and/or any 
further templates one or more template switch signals prior to synthesis of the 
template switched product. In other embodiments of the invention, one or more 
templates comprises at least one template switch signal, which was inserted during 

25 synthesis of the template(s). 

A template switch signal is any modification of a template, capable of inducing a 
higher rate of template switches of a nucleic acid polymerase 

30 The mechanism of template switch and the factors determining when template 
switch occurs are still debated (Negroni and Buc, Nat Rev Mol Cell Biol, 2001, 2: 
151-155). However, it appears that template switch is promoted when a nucleic acid 
polymerase for example reverse transciptase encounters sequences or structures at 
which continued synthesis is difficult (Wu et at., J.Biol.Chem. 1995, 270, 325-332), 
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although evidence for pause independent template switch has also been presented. 
Below models for template switch are summarised with emphasis on aspects of 
relevance for the presented invention. However, it will be appreciated by the person 
skilled in the art that the present invention is not restricted to any of the below de- 
5 scribed models for template switch. 

Pause-driven template switch: Pause-driven template switch occurs when a nucleic 
acid polymerase encounters obstacles in the template that prevents continued syn- 
thesis of product. For example a reverse transcriptase may encounter obstacles in a 

10 donor template RNA that prevents DNA synthesis. When this occurs it has been 
suggested that the RNase H activity of reverse transcriptase degrades the donor 
template RNA yielding short fragments that dissociate from the nascent DNA strand. 
Acceptor template RNA can then anneal to the liberated DNA strand and DNA syn- 
thesis be resumed. In this model f strand transfer is envisaged as an "enzyme-free" 

1 5 process involving only the nucleic acids. 

One natural obstacle to template dependent nucleotide polymerisation catalysed by 
a nucleic acid polymerase is secondary structure in the template. For example, sec- 
ondary structures in the template RNA may be an obstacle for reverse transcription. 
20 However, the involvement of secondary structures in template switch is not obvious 
as the secondary structure must be absent in the acceptor template RNA. In natural 
templates this may be fulfilled if certain secondary structures are formed only after 
RNase H degradation of 3' sequences, i.e. the corresponding sequences in non 
transcribed RNA template does not contain the secondary structure. 

25 

A more radical obstacle to continued template dependent nucleotide polymerisation 
is when a nick or a breakage is present in the template. For example, DNA synthe- 
sis by reverse transcriptase will be paused or stopped by a nick or breakage in the 
template RNA. Reverse transcriptase is then forced to switch template to continue 
30 DNA synthesis. This situation termed "forced copy choice" closely resembles strong 
stop template switch, which occurs during replication of the retroviral genome. 

Active displacement : In active displacement, the acceptor template displaces the 
donor template without requiring strong pausing of nucleic acid polymerase cata- 
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lysed nucleotide polymerisation. Byway of example an acceptor RNA may displace 
a donor RNA during reverse transcription. The acceptor RNA is thought to hybridize 
to the newly synthesized DNA strand behind reverse transcriptase. This is possible 
because the donor RNA is being degraded by the RNase H activity of reverse tran- 
5 scriptase, hence liberating the trailing DNA for interaction with an acceptor RNA. 
When the acceptor RNA/DNA hybrid catches up, the newly synthesized DNA will 
anneal to acceptor RNA as soon as the corresponding donor RNA has been de- 
graded. Hence the acceptor RNA is docked onto the nascent DNA behind the 
RNase H active site and may efficiently compete with the donor RNA for an interac- 
10 tion with reverse transcriptase/the enzyme. However, whether the exchange re- 
quires the dissociation of the enzyme from the nascent DNA is currently not known. 

A variety of different template switch signals may be employed with the present 
invention. For example the template switch signal may be a breakage in the 
15 template. 

In another example the template switch signal comprises a predetermined 
secondary structure. Said predetermined secondary structure may for example 
comprise a double stranded nucleic acid, such as a hairpin loop of said nucleic acid. 

20 For example the predetermined secondary structure may comprise a double 

stranded RNA or it may comprise a double stranded DNA, or a double stranded 
nucleic acid, wherein one of the strands or both of the strands are selected from the 
group consisting of DNA, RNA, LNA and PNA, including any combination thereof, 
such as e.g. DNA:DNA; RNA: RNA; DNA: RNA; DNA:LNA; DNAPNA; RNA: LNA; 

25 RNA:PNA. 

It is generally known that reverse transcriptase is paused when encountering re- 
gions with strong secondary structure. Therefore template switching may be induced 
with short LNA or PNA oligos hybridized to RNA templates. One advantage of this 
30 method is that it is possible to control the sites of recombination and the frequencies 
of recombination at these sites. Since the frequency of template switching might be 
expected to correlate with the stability of secondary structures encountered, the fre- 
quency of template switching may be controlled by adjusting the length and concen- 
tration of LNA/PNA oligos. 
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In yet another embodiment of the invention the template switch signal comprises a 
nucleotide analogue. When a template comprises a certain kind of nucleotide 
analogues, a nucleic acid polymerase may be paused on said template in the 
5 position of said nucleotide analogue. According to one theory pausing of the nucleic 
acid polymerase may cause a template switch, and hence such a nucleotide 
analogue may serve as a template switch signal. 

For example template RNA may contain modified nucleotides that cannot be reverse 
10 transcribed. When reverse transcriptase encounters such modified nucleotides, 

DNA synthesis can only continue if a template switch occurs. Modified nucleotides 
that may be employed in this approach include abasic nucleotides (sugar phosphate 
without base), universal nucleotides (non hydrogen bonding) and pyrene nucleotides 
(base corresponds to entire Watson Crick base pair) (Kunkel and Bebenek, Annu 
15 Rev Biochem, 2000, 69: 497-529). Modified nucleotides can be incorporated in the 
template RNAs either during transcription or as phosphoramidite analogues during 
chemical synthesis. 

In a preferred embodiment of the invention the template switch signal is a breakage 
20 in the template. Said breakage may be introduced during synthesis of the template. 
Methods of introducing these kind of breakages are discussed herein above. 

However, breakages may also be introduced into the template subsequent to 
synthesis of the template. A number of different methods of introducing breakages 
25 into nucleic acids or nucleic analogues are known to the person skilled in the art. 

For example the method may comprise treatment of the first and/or second template 
with iodine. 

30 Furthermore, breakages may be introduced by limited enzymatic digestion. Said 
enzymatic digestion may be catalysed by any suitable enzyme depending on the 
nature of the tempalte. When the template comprises or consists of RNA, the 
enzymatic digestion may for example be performed by a ribonuclease. 
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The breakage(s) may also be introduced by limited alkaline hydrolysis or by limited 
fragmentation using hydroxy! radicals. 

In one embodiment of the invention, the template is an RNA molecule. Breakages 
5 may be introduced into RNA by fragmentation of said RNA. Fragmentation of RNA 
may be carried out in several ways including limited enzymatic digestion with 
ribonucleases, limited alkaline hydrolysis and limited fragmentation using hydroxy I 
radicals. In another approach phosphorothioate ribonucleotide analogues are 
incorporated into the RNA sequences during in vitro transcription. Treatment of RNA 
10 molecules containing phosphorothioate ribonucleotide analogues with iodine 

specifically cleaves phosphorothioester bonds. Hence the degree of fragmentation 
can be controlled by adjusting the frequency of phosphorothioate ribonucleotide 
analogues to ribonucleotides. 

15 It is within the scope of the present invention that more than one different kind of 
template switch signal may be employed with the methods according to the 
invention. Even within one template, different template switch signals may be 
inserted. 

Template switching 

20 

According to current models for the mechanisms of retroviral recombination, it ap- 
pears plausible that the frequency of template switching may be increased by in- 
creasing the pause frequency of the nucleic acid polymerase. Also increasing hy- 
bridization between nascent DNA and acceptor RNA is expected to increase the 
25 frequency of recombination. 

Hence, depending on the nature of the nucleic acid polymerase, template switching 
may be induced by a number of other methods apart from introduction of template 
switch signals. The methods of inducing template switching may be employed alone 
30 or in combination with one another and/or in combination with introduction of 
template switch signals. 
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Hence in one embodiment of the invention the methods comprises that one or more 
factors capable of affecting frequency and/or degree and/or accuracy of template 
switching are added during synthesis of the template switched products. 

5 Said factor can for example be a protein, such as the nucleocapsid protein (NC 

protein). Furthermore said factor may be a minor groove binding factor or an RNA 
chaperone, such as StpA. 

The conditions under which the synthesis of template switched product is performed 
10 may also influence template switching. The processivity and hence pausing fre- 
quency of for example reverse transcriptase may be altered by adjusting reaction 
buffers and in the particular the concentrations of divalent cations and dNTPs. Also 
the temperature might be changed to decrease processivity. E.g. a temperature shift 
from 37 °C to 0 °C is expected to pause reverse transcription and influence, such as 
1 5 increase the frequency of template switches. 

Hence, in one embodiment of the invention, the synthesis is performed at a 
temperature in the range of 0 to 5°C. such as 5-1 5°C, for example 15-25°C, such 
as 25-50°C, for example 50-70°C. 

20 

In another embodiment the synthesis is performed in the presence of a divalent 
cation at a concentration in the range of from 0.1 mM to about 25 mM, such as from 
0.1 mM to 0.5 mM. for example from 0.5 mM to 1 mM, such as from 1 to 2 mM, for 
example from 2 to 4 mM, such as from 4 to 6 mM. for example from 6 to 7 mM, such 

25 as from 7 to 8 mM, for example from 8 to 9 mM. such as from 9 to 10 mM. for 

example from 10 to 1 1 mM. such as from 1 1 to 12 mM, for example from 12 to 14 
mM, such as from 14 to 16 mM, for example from 16 to 18 mM, such as from 18 to 
20 mM, for example from 20 to 22 mM, such as from 22 to about 25 mM, for 
example from 5 to 7 mM, such as from 5 to 8 mM, for example from 5 to 9 mM, such 

30 as from 5 to 10 mM, for example from 5 to 12 mM. such as from 5 to 14 mM, for 

example from 5 to 16 mM. such as from 5 to 18 mM. for example from 5 to 20 mM, 
such as from 5 to 25 mM, for example from 10 to 12 mM, such as from 10 to 14 mM, 
for example from 10 to 16 mM, such as from 10 to 18 mM, for example from 10 to 
20 mM, such as from 1 0 to 25 mM. 
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In vivo treatment with hydroxyurea (causing depletion of the cellular dNTP pools and 
consequently slowing down DNA polymerization by reverse transcriptase) increases 
template switching (Hwang et al., 2001. Proc. Natl. Acad. Sci M 98, 12209-12214). 
5 Thus, in yet another embodiment the synthesis is performed in the presence of 

dIMTPs at a concentration in the range of from 0.1 to 200 ^M, for example from 0.1 
to 100 nM, such as from 0.5 to 1 nM, for example from 1 to 2 ^M, such as from 2 to 
4 nM, for example from 4 to 6 fiM, such as from 6 to 8 ^M, for example from 8 to 10 
|.iM, such as from 10 to 12 ^M, for example from 12 to 14 ^M, such as from 14 to 16 

10 |iM, for example from 16 to 18 ^iM, such as from 18 to 20 jaM, for example from 20 
to 25 jiM, such as from 25 to 30 nM, for example from 30 to 35 nM, such as from 35 
to 40 nM, for example from 40 to 45 jiM, such as from 45 to 50 ^M, for example 
from 50 to 55 pM, such as from 55 to 60 ^M, for example from 60 to 65 nM, such as 
from 65 to 70 jiM, for example from 70 to 75 (iM, such as from 75 to 80 ^M, for ex- 

1 5 ample from 80 to 85 ^M, such as from 85 to 90 ^M, for example from 90 to 95 jaM, 
such as from 95 to 100 jiM, for example from 1 to 5 such as from 1 to 10 \iM, 
for example from 1 to 15 ^M, such as from 1 to 20 ^iM, for example from 1 to 25 ^iM, 
such as from 1 to 30 ^iM, for example from 1 to 35 ^iM, such as from 1 to 40 nM, for 
example from 1 to 45 |aM, such as from 1 to 50 |*M t for example from 1 to 55 ^M, 

20 such as from 1 to 60 ^M, for example from 1 to 70 ^M, such as from 1 to 75 ^M, for 
example from 1 to 80 \iM t such as from 1 to 85 yM, for example from 1 to 95 ^M. 

Furthermore, it is possible to add dNTP analogues that are active site binders, but 
nonincorporable to the template switched product synthesis reaction, such as a re- 

25 verse transcription reaction. In this approach the reverse transcriptase is paused by 
the binding of a dNTP analogue in its active site that cannot be incorporated. Hence 
the enzyme will pause until the dNTP analogue diffuses out of its active site allowing 
entrance of a natural (incorporate) dNTP is temporarily blocked. In this approach 
pausing frequency is controlled by adjusting the ratios of natural dNTP to dNTP ana- 

30 logues. (Wright and Brown, 1985, Biochem Biophys Res Comm., 126, 109-116). 

Accordingly, template switching and thereby recombination may be increased by a 
number of different methods, examples of which are summarised herein below: 
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A) Fragmentation of RNA 

1) Hydrolysis 

2) Radical fragmentation 

5 3) Endonucleolytic degradation 

4) RNase H degradation mediated by DNA oligos 

5) Iodine cleavage of phosphorothioester bonds in RNA incorporated 
during synthesis (Science 1988 Jun 10;240(4858): 1520-2) 

B) Truncation of RNA 

1) Dideoxy termination during transcription 

2) Fragmentation of DNA template 

i) Radical fragmentation 

ii) Endonucleolytic degradation 

iii) Exonucleolytic degradation 

3) Exonucleolytic truncation 
i) Phosphorothioate stopblocks (no refs.) 

C) Pausing reverse transcriptase 

1) Incorporate modified nucleotide as stop/pause sites (abasic ana- 
logues, universal nucleotides, pyrene nucleotides and other nucleo- 
tides that cannot be read/reverse transcribed) 

2) Hybridize LNA/PNA/morpholino/natural oligos to donor RNA (mim- 
icking secondary structure) 

i) Add oligos and crosslink to donor RNA 

3) Add intercalators 

4) Modify template with RNA modifying agents (DMS, kethoxal, CMCT, 

DEPC) 

30 D) Decreasing reverse transcriptase processivity 

1) Dope dNTP pool with non-incorporable dNTP mimics 

2) Decrease dNTP concentration 

3) Add intercalators 

4) Decrease divalent cation concentration 
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5) Low or high temperature 

E) Increasing hybridisation between nascent DNA and acceptor RNA 

1) Add NC protein (Proc Natl Acad Sci U S A 2000 Jun 6;97(12):6385- 
5 90) 

2) Add other RNA chaperone, e.g. StpA 

3) Add chemicals (e.g. cationic detergents (examples including but not 
limited to : cetyltrimethylammonium bromide (CTAB), dodecyltri 
methylammonium bromide (DTAB), tetramethylammonium chloride 

10 (TMAC), and the like), modulation of ionic strength by modifying 

Na(+) and/or K(+) ion concentration, polyethylene glycol (PEG; final 
concentration of 0.5-20%), dextran sulphate, polyvinylpyrrolidone, 

and the like) 

15 F) Decreasing hybridisation specificity 

1) Increase salt concentration (mono and divalent cations) 

G) Affecting strand transfer efficiency of RT 

1 ) Engineering reverse transcriptase for increased efficiency 
20 2) Add (engineered) NC protein 

3) Increasing RNaseH activity 

H) Addition of RNA oligos (chemically synthesized or transcribed from chemically 
synthesized DNA templates) to enhance crossover at predetermined positions. 

25 

Nucleic acids and nucleic acid analogues 

The following section is directed to nucleic acids, analogues thereof, and templates 
and/or template switched products comprising such nucleic acids. 

30 

Nucleic acids according to the invention comprise a plurality of nucleotides. Nucleic 
acid analogues comprises one or more nucleotide analogues and optionally one or 
more nucleotides. 
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Nucleotides according to the invention includes ribonucleotides comprising a nu- 
cleobase selected from the group consisting of adenine (A), uracil (U), guanine (G), 
and cytosine (C), and deoxyribonucleotide comprising a nucleobase selected from 
5 the group consisting of adenine (A), thymine (T), guanine (G), and cytosine (C). 

Nucleobases are capable of associating specifically with one or more other nucleo- 
bases via hydrogen bonds. Thus it is an important feature of a nucleobase that it 
can only form stable hydrogen bonds with one or a few other nucleobases, but that it 
10 can not form stable hydrogen bonds with most other nucleobases usually including 
itself. The specific interaction of one nucleobase with another nucleobase is gener- 
ally termed "base-pairing". 

The base pairing results in a specific hybridisation between predetermined and 
15 complementary nucleotides. Complementary nucleotides according to the present 
invention are nucleotides that comprise nucleobases that are capable of base- 
pairing. 

Of the naturally occurring nucleobases adenine (A) pairs with thymine (T) or uracil 
20 (U); and guanine (G) pairs with cytosine (C). Accordingly, e.g. a nucleotide 

comprising A is complementary to a nucleotide comprising either T or U, and a 
nucleotide comprising G is complementary to a nucleotide comprising C. 

Nucleotide analogues according to the present invention are capable of specific hy- 
25 bridisation and are capable of associating with the active site of at least one nucleic 
acid polymerase. 

Hence, some nucleotide analogues are capable of being incorporated into a growing 
nucleic acid strand similar to wild type nucleotides, while other nucleotide analogues 
30 are capable of being inserted only at the end of a nucleic acid strand and yet other 
nucleotide analogues are capable of associating with a nucleic acid polymerase, but 
not capable of being incorporated into a nucleic acid strand. 
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For example, the nucleotide analogue may be capable of entering the active site of 
the polymerase, but not capable of being incorporated into a nucleic acid. 
Furthermore, a nucleotide analogue may be capable of being incorporated to the 
end of a nucleic acid molecule, but said incorporation of the nucleotide analogue 
5 inhibits elongation of the nucleic acid. 

In one embodiment of the invention the nucleotide analogue is a phosphorothioate 
ribonucleotide analogue. 

10 In one embodiment nucleotide analogues include any nucleotide analogues capable 
of specific base-pairing, for example derivatives of naturally occurring nucleotides or 
nucleotide analogues wherein the nucleotide backbone differs from naturally occur- 
ring nucleotide backbones. 

15 Specific examples of preferred nucleotide analogues useful in this invention include 
nucleotides analogues containing modified backbones or non-natural intemucleo- 
side linkages. As defined in this specification, nucleotide analogues having modified 
backbones include those that retain a phosphorus atom in the backbone and those 
that do not have a phosphorus atom in the backbone. For the purposes of this speci- 

20 fication, and as sometimes referenced in the art, modified nucleotide analogues that 
do not have a phosphorus atom in their internucleoside backbone can also be con- 
sidered to be nucleosides. 

Preferred modified nucleotide analogue backbones include, for example, phos- 
25 phorothioates, chiral phosphorothioates, phosphorodithioates, phosphotriesters, 
aminoalkylphosphotriesters, methyl and other alkyl phosphonates including 3 - 
alkylene phosphonates and chiral phosphonates, phosphinates, phosphoramidates 
including 3'-amino phosphoramidate and aminoalkylphosphoramidates, thionophos- 
phoramidates, thionoalkylphosphonates, thionoalkylphosphotriesters, and borano- 
30 phosphates having normal 2-5' linkages, Z-5' linked analogs of these, and those 

having inverted polarity wherein the adjacent pairs of nucleoside units are linked 3'- 
5' to 5*-3' or 2'-5' to 5'-2\ Various salts, mixed salts and free acid forms are also in- 
cluded. 
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Representative United States patents that teach the preparation of the above phos- 
phorus-containing linkages include, but are not limited to U.S. Pat. Nos. 3,687,808; 
4,469,863; 4,476,301; 5,023,243; 5,177,196; 5,188,897; 5,264,423; 5,276,019; 
5,278,302; 5,286,717; 5,321,131; 5,399,676; 5,405,939; 5,453,496; 5,455,233; 
5 5,466,677; 5,476,925; 5,519,126; 5,536,821; 5,541,306; 5,550,111; 5,563,253; 
5,571,799; 5,587,361; and 5,625,050. 

Preferred modified nucleotide analogue backbones that do not include a phosphorus 
atom therein have backbones that are formed by short chain alkyl or cycloalkyl in- 
ternucleoside linkages, mixed heteroatom and alkyl or cycloalkyl internucleoside 
linkages, or one or more short chain heteroatomic or heterocyclic internucleoside 
linkages. These include those having morpholino linkages (formed in part from the 
sugar portion of a nucleoside); siloxane backbones; sulfide, sulfoxide and sulfone 
backbones; formacetyl and thioformacetyl backbones; methylene formacetyl and 
thioformacetyl backbones; alkene containing backbones; sulfamate backbones; me- 
thyleneimino and methylenehydrazino backbones; sulfonate and sulfonamide back- 
bones; amide backbones; and others having mixed N, O, S and CH 2 component 
parts. 

Representative United States patents that teach the preparation of the above nu- 
cleosides include, but are not limited to, U.S. Pat. Nos. 5,034,506; 5,166,315; 
5,185,444; 5,214,134; 5,216,141; 5,235,033; 5,264,562; 5,264,564; 5,405,938; 
5,434,257; 5,466,677; 5,470,967; 5,489,677; 5,541,307; 5,561,225; 5,596,086; 
5,602,240; 5,610,289; 5,602,240; 5,608,046; 5,610,289; 5,618,704; 5,623,070; 
5,663,312; 5,633,360; 5,677,437; and 5,677,439. 

In other preferred nucleotide analogues, both the sugar and the internucleoside link- 
age, i.e., the backbone, of the nucleotide units are replaced with novel groups. The 
base units are maintained for hybridization with an appropriate nucleic acid target 
30 compound. One such nucleotide analogue that has been shown to have excellent 
hybridization properties, is referred to as a peptide nucleic acid (PNA). In PNA com- 
pounds, the sugar-backbone of an oligonucleotide is replaced with an amide con- 
taining backbone, in particular an aminoethylglycine backbone. The nucleobases 
are retained and are bound directly or indirectly to aza nitrogen atoms of the amide 
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portion of the backbone. Representative United States patents that teach the prepa- 
ration of PNA compounds include, but are not limited to, U.S. Pat. Nos.: 5,539,082; 
5,714,331; and 5,719,262. Further teaching of PNA compounds can be found in 
Nielsen et al. (Science, 1991, 254, 1497-1500). 

5 

Most preferred embodiments of the invention are nucleotide analogues with phos- 
phorothioate backbones and nucleosides with heteroatom backbones, and in par- 
ticular -CH 2 -NH--0--CH 2 --, -CH 2 -N(CH 3 )-0~CH 2 - [known as a methylene 
(methylimino) or MMI backbone], ~CH 2 0— N(CH 3 )--CH 2 ~, --CH 2 ~N(CH 3 )~N(CH 3 )-- 
10 CH 2 - and --0~N(CH 3 )--CH 2 --CH 2 -- [wherein the native phosphodiester backbone is 
represented as ~0-P~0-CH 2 -) of the above referenced U.S. Pat. No. 5,489,677, 
and the amide backbones of the above referenced U.S. Pat. No. 5,602,240. Also 
preferred are nucleotide analogues having morpholino backbone structures of the 
above-referenced U.S. Pat. No. 5,034,506. 

15 

Modified nucleotide analogues may also contain one or more substituted sugar 
moieties. Preferred nucleotide analogues comprise one of the following at the 2' 
position: OH; F; O--, S~, or N-alkyl, O-alkyl-O-alkyl, O--, S-- f or N-aikenyl, or O--, S- 
or N-alkynyl, wherein the alkyl, alkenyl and alkynyl may be substituted or unsubsti- 
20 tuted C 4 to C 10 alkyl or C 2 to C 10 alkenyl and alkynyl. 

Particularly preferred are 0[(CH 2 ) n O] m CH 3 , 0(CH 2 ) n OCH 3 , 0(CH 2 ) 2 ON(CH 3 ) 2 , 
0(CH 2 ) n NH 2 , 0(CH 2 ) n CH 3 , 0(CH 2 ) n ONH 2t and 0(CH 2 ) n ON[(CH 2 ) n CH 3 )] 2 , where n 
and m are from 1 to about 10. 

25 

Other preferred nucleotide analogues comprise one of the following at the 2' posi- 
tion: C* to C10 lower alkyl, substituted lower alkyl, alkaryl, aralkyl, O-alkaryl or O- 
aralkyl, SH, SCH 3 , OCN, CI, Br, CN, CF 3 , OCF 3 , SOCH 3 , S0 2 , CH 3 , ON0 2 , N0 2l N 3 , 
NH 2 , heterocycloalkyl, heterocycloalkaryl, aminoalkylamino, polyalkylamino, substi- 
30 tuted silyl, an RNA cleaving group, a reporter group, an intercalator, a group for im- 
proving the pharmacokinetic properties of an oligonucleotide, or a group for improv- 
ing the pharmacodynamic properties of an oligonucleotide, and other substituents 
having similar properties. A preferred modification includes 2'-methoxyethoxy 
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(2'-0--CH 2 CH 2 OCH 3 , also known as 2'-0-(2-methoxyethyl) or 2'-MOE) (Martin et 
al., Helv. Chim. Acta 1995, 78, 486-504) i.e.. an alkoxyalkoxy group. 

Other preferred modifications include Z-methoxy (2'-0--CH 3 ), Z-aminopropoxy (2- 
5 OCH 2 CH 2 CH 2 NH 2 ) and 2'-fluoro (2'-F). Similar modifications may also be made at 
other positions on the nucleotide, particularly the 3* position of the sugar on the 3' 
terminal nucleotide or in 2*-5' linked nucleotides and the 5* position of 5' terminal 
nucleotide. Nucleotide analogues may also have sugar mimetics such as cyclobutyl 
moieties in place of the pentofuranosyl sugar. Representative United States patents 
1 0 that teach the preparation of such modified sugars structures include, but are not 
limited to. U.S. Pat. Nos. 4.981.957; 5,118,800; 5,319,080; 5,359,044; 5,393,878; 
5,446.137; 5,466,786; 5,514.785; 5,519,134; 5,567,811; 5,576,427; 5,591.722; 
5,597,909; 5,610,300; 5.627,053; 5.639,873; 5,646,265; 5,658,873; 5,670,633; and 
5,700,920. 

15 

Nucleotide analogues may also include nucleobase (often referred to in the art 
simply as "base") modifications or substitutions. As used herein, "unmodified" or 
"natural" nucleobases include the purine bases adenine (A) and guanine (G), and 
the pyrimidine bases thymine (T), cytosine (C) and uracil (U). Modified nucleobases 
20 include other synthetic and natural nucleobases such as 5-methylcytosine (5-me-C 
or m5c), 5-hydroxymethyl cytosine, xanthine, hypoxanthine, 2-aminoadenine, 6- 
methyl and other alkyl derivatives of adenine and guanine. 2-propyl and other alkyl 
derivatives of adenine and guanine. 2-thiouracil, 2-thiothymine and 2-thiocytosine, 5- 
halouracil and cytosine, 5-propynyl uracil and cytosine, 6-azo uracil, cytosine and 
25 thymine, 5-uracil (pseudouracil), 4-thiouracil, 8-halo, 8-amino, 8-thiol, 8-thioalkyl, 8- 
hydroxyl and other 8-substituted adenines and guanines, 5-halo particularly 5- 
bromo, 5-trifluoromethyl and other 5-substituted uracils and cytosines. 7- 
methylguanine and 7-methyladenine, 8-azaguanine and 8-azaadenine, 7- 
deazaguanine and 7-deazaadenine and 3-deazaguanine and 3-deazaadenine. 
30 Further nucleobases include those disclosed in U.S. Pat. No. 3,687,808, those 

disclosed in the Concise Encyclopedia Of Polymer Science And Engineering 1990, 
pages 858-859. Kroschwitz, J. I., ed. John Wiley & Sons, those disclosed by 
Englisch et al. (Angewandte Chemie, International Edition 1991. 30, 613-722), and 
those disclosed by Sanghvi, Y. S.. Crooke, S. T. and Lebleu, B„ eds., Antisense 
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Research and Applications 1993, CRC Press, Boca Raton, pages 289-302. Certain 
of these nucleobases are particularly useful for increasing the binding affinity of the 
oligorneric compounds of the invention. These include 5-substituted pyrimidines, 6- 
azapyrimidines and N-2, N-6 and 0-6 substituted purines, including 2- 
5 aminopropyladenine, 5-propynyluracil and 5-propynylcytosine. 5-Methylcytosine 
substitutions have been shown to increase nucleic acid duplex stability by 0.6- 
1.2. degree. C. (Sanghvi, Y. S., Crooke, S. T. and Lebleu, B. f eds., Antisense 
Research and Applications 1993, CRC Press, Boca Raton, pages 276-278) and are 
presently preferred base substitutions, even more particularly when combined with 
1 0 2'-0-methoxyethyl sugar modifications. 

Representative United States patents that teach the preparation of certain of the 
above noted modified nucleobases as well as other modified nucleobases include, 
but are not limited to, the above noted U.S. Pat. Nos. 3,687,808, as well as U.S. Pat. 
15 No. 4,845,205; 5,130,302; 5,134,066; 5,175,273; 5,367,066; 5,432,272; 5,457,187; 
5,459,255; 5,484,908; 5,502,177; 5,525,711; 5,552,540; 5,587,469; 5,594,121, 
5,596,091; 5,614,617; and 5,681,941. 

Another modification of the nucleotide analogues of the invention involves 
20 chemically linking to the nucleotide one or more moieties or conjugates which 

enhance the activity, cellular distribution or cellular uptake of the nucleotide. Such 
moieties include but are not limited to lipid moieties such as a cholesterol moiety 
(Letsinger et al., Proc. Natl. Acad. Sci. USA 1989, 86, 6553-6556), cholic acid 
(Manoharan et al., Bioorg. Med. Chem. Lett. 1994, 4, 1053-1059), a thioether, e.g., 
25 hexyl-S-tritylthiol (Manoharan et al., Ann. N.Y. Acad. Sci. 1992, 660, 306-309; 

Manoharan et al., Bioorg. Med. Chem. Let. 1993, 3, 2765-2770), a thiocholesterol 
(Oberhauser et al., Nucl. Acids Res. 1992, 20, 533-538), an aliphatic chain, e.g., 
dodecandiol or undecyl residues (Saison-Behmoaras et al., EMBO J. 1991, 10, 
1111-1118; Kabanov et al., FEBS Lett. 1990, 259, 327-330; Svinarchuk et al., 
30 Biochimie 1993, 75, 49-54), a phospholipid, e.g., di-hexadecyl-rac-glycerol or 

triethylammonium 1,2-di-0-hexadecyl-rac-glycero-3-H-phosphonate (Manoharan et 
al., Tetrahedron Lett. 1995, 36, 3651-3654; Shea et al., Nucl. Acids Res. 1990, 18, 
3777-3783), a polyamine or a polyethylene glycol chain (Manoharan et al., 
Nucleosides & Nucleotides 1995, 14, 969-973), or adamantane acetic acid 
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(Manoharan et aL, Tetrahedron Lett. 1995, 36, 3651-3654), a palmityl moiety 
(Mishra et al., Biochim, Biophys. Acta 1995, 1264, 229-237), or an octadecylamine 
or hexylarnino-carbonyl-oxycholesterol moiety (Crooke et aL, J. Pharmacol. Exp. 
Ther. 1996, 277, 923-937). 

Representative United States patents that teach the preparation of such 
oligonucleotide analogue conjugates include, but are not limited to, U.S. Pat. Nos. 
4,828,979; 4,948,882; 5,218,105; 5,525,465; 5,541,313; 5,545,730; 5,552,538; 
5,578,717, 5,580,731; 5,580,731; 5,591,584; 5,109,124; 5,118,802; 5,138,045; 
5,414,077; 5,486,603; 5,512,439; 5,578,718; 5,608,046; 4,587,044; 4,605,735; 
4,667,025; 4,762,779; 4,789,737; 4,824,941; 4,835,263; 4,876,335; 4,904,582; 
4,958,013; 5,082,830; 5,112,963; 5,214,136; 5,082,830; 5,112,963; 5,214,136; 
5,245,022; 5,254,469; 5,258,506; 5,262,536; 5,272,250; 5,292,873; 5,317,098; 
5,371,241, 5,391,723; 5,416,203, 5,451,463; 5,510,475; 5,512,667; 5,514,785; 
5,565,552; 5,567,810; 5,574,142; 5,585,481; 5,587,371; 5,595,726; 5,597,696; 
5,599,923; 5,599,928 and 5,688,941. 

The present invention also includes nucleic acid analogues which are chimeric 
nucleic acid analogues. "Chimeric" nucleic acid analogues or "chimeras," in the 
20 context of this invention, are nucleic acid analogues which contain two or more 

chemically distinct regions, each made up of at least one nucleotide. These nucleic 
acid analogues typically contain at least one region wherein the nucleic acid 
analogues is modified so as to confer upon the nucleic acid analogues increased 
resistance to nuclease degradation, increased cellular uptake, and/or increased 
25 binding affinity for the target nucleic acid. An additional region of the nucleic acid 

analogues may serve as a substrate for enzymes capable of cleaving RNA:DNA or 
RNA:RNA hybrids. By way of example, RNase H is a cellular endonuclease which 
cleaves the RNA strand of an RNA:DNA duplex. Activation of RNase H, therefore, 
results in cleavage of the RNA target, thereby greatly enhancing the efficiency of 
30 antisense inhibition of gene expression. Cleavage of the RNA target can be routinely 
detected by gel electrophoresis and, if necessary, associated nucleic acid 
hybridization techniques known in the art. This RNAse H-mediated cleavage of the 
RNA target is distinct from the use of ribozymes to cleave nucleic acids. Ribozymes 
are not comprehended by the present invention. 



10 
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Examples of chimeric nucleic acid include but are not limited to "gapmers," in which 
three distinct regions are present, normally with a central region flanked by two 
regions which are chemically equivalent to each other but distinct from the gap. A 
5 preferred example of a gapmer is an oligonucleotide in which a central portion (the 
"gap") of the oligonucleotide serves as a substrate for RNase H and is preferably 
composed of 2'-deoxynucleotides, while the flanking portions (the 5' and 3' "wings") 
are modified to have greater affinity for the target RNA molecule but are unable to 
support nuclease activity (e.g., fluoro- or 2 , -Omethoxyethyl-substituted). Chimeric 

10 nucleic acid are not limited to those with modifications on the sugar, but may also 

include nucleic acid with modified backbones, e.g., with regions of phosphorothioate 
(P.dbd.S) and phosphodiester (P.dbd.O) backbone linkages or with regions of MMI 
and P.dbd.S backbone linkages. Other chimeras include "wingmers," also known in 
the art as "hemimers," that is, nucleic acid with two distinct regions. In a preferred 

15 example of a wingmer, the 5' portion of the oligonucleotide serves as a substrate for 
RNase H and is preferably composed of 2-deoxynucleotides, whereas the 3' portion 
is modified in such a fashion so as to have greater affinity for the target RNA 
molecule but is unable to support nuclease activity (e.g., 2'-fluoro- or 2'-0- 
methoxyethyl-substituted), or vice-versa. In one embodiment, the nucleic acid of the 

20 present invention contain a 2*-0-methoxyethyl (2'-0--CH 2 CH 2 OCH) 3 modification on 
the sugar moiety of at least one nucleotide. This modification has been shown to 
increase both affinity of the nucleic acid analogues for its target and nuclease 
resistance of the nucleic acid analogues. According to the invention, one, a plurality, 
or all of the nucleotide and/or nucleotide analogue subunits of the nucleic acid 

25 analogues of the invention may bear a 2'-0-methoxyethyl (--0--CH 2 CH 2 OCH 3 ) 
modification, nucleic acid analogues comprising a plurality of nucleotide and/or 
nucleotide analogue subunits having a 2'-0-methoxyethyl modification can have 
such a modification on any of the nucleotide analogue subunits within the nucleic 
acid analogue, and may be chimeric nucleic acids. Aside from or in addition to 2'-0- 

30 methoxyethyl modifications, nucleic acid analogues containing other modifications 
which enhance antisense efficacy, potency or target affinity are also preferred. 

The nucleic acids or nucleic acid analogues used in accordance with this invention 
may be conveniently and routinely made through the well-known technique of solid 
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phase synthesis. Equipment for such synthesis is sold by several vendors including 
Applied Biosystems. Any other means for such synthesis may also be employed; the 
actual synthesis of the nucleic acids is well within the talents of the routineer. It is 
well known to use similar techniques to prepare nucleic acid analogues such as the 
5 phosphorothioates and Z-alkoxy or 2'-alkoxyalkoxy derivatives, including 2'-0- 

methoxyethyl oligonucleotides (Martin, P., Helv. Chim. Acta 1995, 78, 486-504). It is 
also well known to use similar techniques and commercially available modified 
amidites and controlled-pore glass (CPG) products such as biotin, fluorescein, 
acridine or psoralen-modified amidites and/or CPG (available from Glen Research, 
10 Sterling, Va.) to synthesize fluorescently labeled, biotinylated or other conjugated 
oligonucleotide analogues. 



Functional homolooues 

15 

Functional homologues of polypeptides or proteins obtained from template switched 
products generated according to the present invention comprise any polypeptide or 
protein sequence which is capable of performing essentially the same function. By 
way of example, a funtional homologue of HIV reverse transcriptase is capable of 
20 essentially performing the function of HIV reverse transcriptase, i.e. catalysing syn- 
thesis a DNA molecule using an RNA template. 

Functional homologues as e.g. described herein below can be encoded by func- 
tional homologue template switched products generated by template switching 
25 methods of the present invention employing functional homologues of templates, i.e. 
templates the incorporation of which into a template switch product results in said 
template switched product encoding said functional homologue. 

Functional homologues according to the present invention comprise polypeptides 
30 with an amino acid sequence, which are sharing at least some homology with the 
predetermined polypeptide sequences as outlined herein above. For example such 
polypeptides are at least about 40 percent, such as at least about 50 percent ho- 
mologous, for example at least about 60 percent homologous, such as at least 
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about 70 percent homologous, for example at least about 75 percent homologous, 
such as at least about 80 percent homologous, for example at least about 85 per- 
cent homologous, such as at least about 90 percent homologous, for example at 
least 92 percent homologous, such as at least 94 percent homologous, for example 
5 at least 95 percent homologous, such as at least 96 percent homologous, for exam- 
ple at least 97 percent homologous, such as at least 98 percent homologous, for 
example at least 99 percent homologous with the predetermined polypeptide se- 
quences as outlined herein above. 

10 Homology may preferably be calculated by any suitable algorithm or by computer- 
ised implementations of such algorithms for example CLUSTAL in the PC/Gene 
program by Intelligenetics or GAP, BESTFIT, BLAST, FASTA and TFASTA in the 
Wisconsin Genetics Software Package, Genetics Computer Group (GCG). The ho- 
mology between amino acid sequences may furthermore be calculated with the aid 

15 of well known matrices such as for example any one of BLOSUM 30, BLOSUM 40, 
BLOSUM 45, BLOSUM 50, BLOSUM 55, BLOSUM 60, BLOSUM 62, BLOSUM 65, 
BLOSUM 70, BLOSUM 75, BLOSUM 80, BLOSUM 85, and BLOSUM 90. 

Functional homologues may comprise an amino acid sequence that comprises at 
20 least one substitution of one amino acid for any other amino acid. For example such 
a substitution may be a conservative amino acid substitution or it may be a non- 
conservative substitution. 

A conservative amino acid substitution is a substitution of one amino acid within a 
25 predetermined group of amino acids for another amino acid within the same group, 
wherein the amino acids within a predetermined groups exhibit similar or substan- 
tially similar characteristics. Within the meaning of the term "conservative amino acid 
substitution" as applied herein, one amino acid may be substituted for another within 
groups of amino acids characterised by having 

30 

i) polar side chains (Asp, Glu, Lys, Arg, His, Asn, Gin, Ser, Thr, Tyr, and Cys,) 



ii) non-polar side chains (Gly, Ala, Val, Leu, He, Phe, Trp, Pro, and Met) 
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iii) aliphatic side chains (Gly, Ala Val, Leu, lie) 

iv) cyclic side chains (Phe, Tyr, Trp, His, Pro) 
5 v) aromatic side chains (Phe, Tyr, Trp) 

vi) acidic side chains (Asp, Glu) 

vii) basic side chains (Lys, Arg, His) 

0 

viii) amide side chains (Asn, Gin) 

ix) hydroxy side chains (Ser, Thr) 

5 x) sulphor-containing side chains (Cys, Met), and 

xi) amino acids being monoamino-dicarboxylic acids or monoamino- 
monocarboxylic-monoamidocarboxylic acids (Asp, Glu, Asn, Gin). 

20 Functional homologues according to the present invention may comprise more than 
one such substitution, such as e.g. two amino acid substitutions, for example three 
or four amino acid substitutions, such as five or six amino acid substitutions, for ex- 
ample seven or eight amino acid substitutions, such as from 10 to 15 amino acid 
substitutions, for example from 15 to 25 amino acid substitution, such as from 25 to 

25 30 amino acid substitutions, for example from 30 to 40 amino acid substitution, such 
as from 40 to 50 amino acid substitutions, for example from 50 to 75 amino acid 
substitution, such as from 75 to 100 amino acid substitutions, for example more than 
100 amino acid substitutions. 

30 The addition or deletion of an amino acid may be an addition or deletion of from 2 to 
5 amino acids, such as from 5 to 10 amino acids, for example from 10 to 20 amino 
acids, such as from 20 to 50 amino acids. However, additions or deletions of more 
than 50 amino acids, such as additions from 50 to 200 amino acids, are also com- 
prised within the present invention. 
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Additional factors may be taken into consideration when determining functional 
homologues according to the meaning used herein. For example functional homo- 
logues may be capable of associating with antisera which are specific for the poly- 
5 peptides according to the present invention. 

Peptides with N-terminal alkylations and C-terminal esterifications are also encom- 
passed within the present invention. Functional equivalents also comprise glycosy- 
lated and covalent or aggregative conjugates, including dimers or unrelated chemi- 
10 cal moieties. Such functional equivalents are prepared by linkage of functionalities to 
groups which are found in fragment including at any one or both of the N- and C- 
termini, by means known in the art. 

Functional equivalents may thus comprise fragments conjugated to aliphatic or acyl 
1 5 esters or amides of the carboxyl terminus, alkylamines or residues containing car- 
boxyl side chains, e.g., conjugates to alkylamines at aspartic acid residues; O-acyl 
derivatives of hydroxyl group-containing residues and N-acyl derivatives of the 
amino terminal amino acid or amino-group containing residues, e.g. conjugates with 
Met-Leu-Phe. Derivatives of the acyl groups are selected from the group of aikyl- 
20 moieties (including C3 to C10 normal alkyl), thereby forming alkanoyl species, and 
carbocyclic or heterocyclic compounds, thereby forming aroyl species. The reactive 
groups preferably are difunctional compounds known per se for use in cross-linking 
proteins to insoluble matrices through reactive side groups. 

25 

Examples 

The following examples illustrates specific embodiments of the invention and should 
not be regarded as limiting to the invention. 

30 

Example 1 



Four polynucleotide sequences are shuffled and subsequently selected on basis of 
binding to a DNA affinity column. 
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DNA oliqos 

DNA template oligos: 

5 T7pro GACCT CGATC GATCG ATCCT TTAGC CCTAT ATTTA GCTAA TCCGA 

ATGGG CAGGA CTTAT ATGGC AGTAG TCATC GCATC-3' 

T7pro GACCT CGATC GATCG ATCCT TTAGC CCTAT ATTTA GCTAA CCAGC 
GCTTA CAGGA CTTAT ATGGC AGTAG TCATC GCATC-3' 

T7pro GACCT CGATC GATCG ATCCT TTAGC CCTAT ATTTA ATCGA TCCGA 
10 GCTTA CAGGA CTTAT ATGGC AGTAG TCATC GCATC-3' 

T7pro GACCT CGATC GATCG ATCCT TTAGC CCTAT GTCTG GCTA A TCCGA 
GCTTA CAGGA CTTAT ATGGC AGTAG TCATC GCATC-3' 

T7pro * T7 RNAP promoter: S'TAATA CGACT CACTA TAG 

15 

Selector oligo: 

5' -biotin-GTCTG ATCGA CCAGC ATGGG- 3' 
DNA primer 1. 

20 5' -TAATA CGACT CACTA TAG GACCT CGATC GATCG ATC-3' 

DNA primer 2: 

5'-GATGC GATGA CTACT GCC-3' 



25 Preparation of RNA templates 

RNA templates are prepared by T7 RNA polymerase directed in vitro transcription. 
Double stranded DNA templates for T7 transcription are prepared by PCR with DNA 
primer 1 + DNA primer 2 utilizing the four DNA template oligos as PCR templates. 
30 The transcription reaction is performed using standard protocols, e.g. in 40 mM Tris- 
HCI pH 8.1, 1 mM spermidine, 0.01% Triton X-100, 5 mM Dithiothreitol, 5 mM 
MgCI 2 . 10 mM NaCI, 400 pM each NTP. template DNA and T7 RNA polymerase. 
The reaction is normally performed at 37°C. After synthesis, DNA templates are di- 
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gested with RNase free DNase I and synthesized RNA may be either gelpurified, 
gelfiltrated or phenol-extracted and ethanol precipitated. 

Shuffling of parental RNAs 

5 

To enhance template switching the RNA templates from above can be fragmented 
or other template switch signals may be introduced. Below are described two meth- 
ods for fragmentation and one method for introducing template switch signals. 

10 A) Fragmentation: 

i) Fragmentation by alkaline hydrolysis: RNA samples are added NaOH to 
0.5 M and incubated at 50°C for 1 min, whereafter alkaline hydrolysis is 
stopped by addition of 14 volumes 1M HCI. After neutralization, samples 
are gelfiltrated. 

15 ii) Phosphorothioate incorporation and iodoethanol cleavage: Phos- 

phorothioate ribonucleotide analogues are incorporated into RNA tem- 
plates during transcription. In the described example, a desired fre- 
quency of phosphorothioate analogue to natural nucleotides may be 
0.05, subsequently giving rise to approximately 1 fragmentation per 20 

20 nt. To achieve this frequency, natural ribonucleotides are used at 1 mM 

and phosphorthioate analogues at 0.05 mM (Ryder and Strobel, Methods 
1999 May; 18(1):38-50). After transcription, reactions are treated with 
RNase free DNase I, phenol-extracted and ethanol-precipitated. Precipi- 
tated RNAs are resuspended in water and fragmented by iodoethanol, 

25 e.g. by adding a !4 volume of 10 mM Na 2 EDTA f 95 % formamide, 7 % 

(v/v) iodoethanol followed by heating to 95°C for 3 minutes (Gish and 
Eckstein, Science 1988 Jun 10; 240(4858): 1520-2). After fragmentation, 
formamide and iodoethanol is removed by gelfiltration to allow subse- 
quent reverse transcription of fragmented RNAs 

30 

The degree of fragmentation by the various methods is easily analysed by using 
endlabelled RNA that are resolved on sequencing gels after fragmentation. RNA is 



WO 2004/009814 



PCT/DK2003/000490 



56 

endlabeled using T4 polynucleotide kinase and [y- 32 P] ATP using standard proto- 
cols. 



B) Introduction of template switch signals: 
5 Instead of forcing reverse transcriptase to pause by fragmentation of template 
RNAs, pausing may be induced by the provoked formation of strong secondary 
structures in RNA templates. This can be achieved by annealing oligos into template 
RNAs. Oligos may be composed RNA, DNA, PNA, LNA or other nucleotide ana- 
logues, LNA has a very high binding affinity for RNA and may therefore be pre- 

10 ferred. The length, sequence and amount of LNA oligomers can be adjusted to suit 
any particular needs. In this example, LNA oligomers of the length 6 are used at a 1- 
1 ratio with RNA templates. The sequences of LNA oligos are designed to cover the 
entire region in which recombination is desired, as illustrated below. LNA oligos are 
mixed with template RNA and DNA prirner-2 in RT-buffer and annealed by slow 

15 cooling from 65 °C. 



RNA templates: 

G GACCU CGAUC GAUCG AUCCU UUAGC CCUAO AUUUA GCUAA UCCGA AUGGG CAGGA CUUAU AUGGC 
AGUAG UCAUC GCAUC-3' 

20 G GACCU CGAUC GAUCG AUCCU UUAGC CCUAU AUUUA GCUAA CCAGC GCUUA CAGGA CUUAU AUGGC 

AGUAG UCAUC GCAUC-3' 

G GACCU CGAUC GAUCG AUCCU UUAGC CCUAU AUUUA AUCGA UCCGA GCUUA CAGGA CUUAU AUGGC 
AGUAG UCAUC GCAUC-3' 

G GACCU CGAUC GAUCG AUCCU UUAGC CCUAU GUCUG GCUAA UCCGA GCUUA CAGGA CUUAU AUGGC 
25 AGUAG UCAUC GCAUC-3' 



LNA oligos (3' to 5' ) : 

TAGGA A 
AGGA AA 

30 GGA AAT 

GA AATC 
A AATCG 
AATCG G 
ATCG GG 

35 TCG GGA 

CG GGAT 
G GGATA 
GGATA T 
GAT A TA 

40 ATA TAA 

TA TAAA 
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A TAAAT 
TAAAT C 
AAAT CG 
AAT CGA 

5 AT CGAT 

T CGATT 
CGATT A 
GATT AG 
ATT AGG 

10 TT AGGC 

T AGGCT 
AGGCT C 
GGCT CG 
GCT CGA 

15 CT CGAA 

T CGAAT 
CGAAT G 
GAAT GT 
AAT GTC 

20 AT GTCC 

T GTCCT 
GTCCT G 
TCCT GA 
CCT GAA 

25 CT GAAT 

T GAAT A 
GAATA T 
ATTA TA 



30 



Reverse transcription 

DNA primer-2 (and optionally LNA oligos) is annealed to the template RNAs by mix- 
ing and slow cooling from 65 °C in RT buffer (50 mM Tris-HCI, pH 8.0, 75 mM KCI, 
35 0.1 mM EDTA, 1 mM dithiothreitol, and 0.1 % Triton X-100), whereafter reverse 

transcription and hence strand transfer is initated by addition of 100 pM each dNTP, 
7 mM MgCI 2 and HIV-1 reverse transcriptase. The reaction is carried out at 37°C. 
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Separation of cDNAs with a predetermined property 

After reverse transcription, the recombined cDNA is selected on basis of binding to a 
selector oligo immobilized via biotin on streptavidin beads (agarose, sepharose, 
magnetic), or in streptavidin coated tubes. 

DNase free RNase I is added to RT reactions to remove RNA templates, whereafter 
cDNA is gel purified. Streptavidin magnetic beads are prepared by equilibration in 
binding buffer (e.g. 50 mM K-Hepes pH 7.0, 100 mM KCI) and attachment of the 
selector oligo via its biotin moiety to streptavidin. cDNAs are then added and the 
binding reactions are incubated at 25 °C 20 min. Hereafter the binding reactions are 
washed several times with binding buffer; thus cDNAs are selected on basis of their 
ability to hybridise to the selector oligo. 

The selected sequences may be amplified by PCR using streptavidin magnetic 
15 beads with selected cDNAs directly as templates in the PCR. Alternatively, selected 
cDNAs may be eluted with 200 mM KCI, 100 mM NaOH pH 13. After elution, the pH 
is adjusted with addition of 1/10 volume 1M HCI. If the selected sequences are cap- 
tured on the selector oligo immobilized in streptavidin coated PCR tubes, PCR can 
be performed directly without further handling of the sample. 

20 

PCR amplification is carried out with primer-1 and primer-2 using standard proce- 
dures. PCR products are then purified and may serve as templates in another round 
of RNA synthesis, recombination and selection or they may be sequenced for analy- 
sis. 



5 



10 
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Claims 



1 . A method for preparing a template switched product encoded by at least part of 
5 one first template and at least part of at least one second template, wherein said 

product comprises at least one predetermined property, said method comprising 
the steps of 

i) providing a first template molecule and at least one second template 
10 molecule; and 

ii) providing a nucleic acid polymerase; and 

iii) synthesising a plurality of different template switched products by 

1 5 contacting sequentially in any order, or simultaneously, at least part of the 

first template and at least part of the at least one second template with 
said polymerase under conditions allowing for template dependent 
nucleotide polymerisation, 

20 wherein the synthesis of each individual template switched product 

involves at least one template switch, 

and wherein the synthesis of the plurality of different template switched 
products involves a plurality of template switches, 

25 

iv) separating at least one template switched product comprising the at least 
one predetermined property from said plurality of template switched 
products; and 

30 v) obtaining a template switched product comprising at least one 

predetermined property 



2. The method of claim 1 , wherein 
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i) the first template comprises a first activity or encodes a molecule 
comprising a first activity; and 

ii) the second template comprises a second activity or encodes a molecule 
comprising a second activity; and 

iii) the predetermined property of the template switched product is a third 
activity, either comprised within the template switched product or in a 
molecule encoded by the template switched product; and 

wherein the first and second activities are both different from the third activity. 



10 



3. The method of any of claims 1 and 2, wherein the method involves switches 
between more than 2 templates, such as 3, for example 4, such as 5, for 
example in the range from 5 to 10, such as from 10 to 20, for example from 20 to 
50, such as from 50 to 100, for example from 100 to 200, such as more than 200 

15 templates. 

4. The method of claim 3, wherein the method involves more than 2 different 
template switches, such as 3, for example 4, such as 5, for example in the range 
from 5 to 10, such as from 10 to 20, for example from 20 to 50, such as from 50 

20 to 100, for example from 100 to 200, such as more than 200 different template 

switches. 

5. The method of claim 3, wherein the number of different template switched 
products generated is more than 20, for example more than 50, such as more 

25 than 500, for example more than 1000, such as more than 10000, for example 

more than 100000, such as more than 1000000. 

6. The method of claim 3, wherein the number of different template switched 
products generated is in the range of from 1 to 1000000, such as from 1 to 

30 500000, for example from 5 to 500000, such as from 5 to 250000, for example 

from 10 to 200000, such as from 10 to 150000, for example from 15 to 100000, 
such as from 5 to 250000, for example from 10 to 200000, such as from 10 to 
150000, for example from 15 to 100000, such as from 20 to 200000, for example 
from 20 to 150000, such as from 25 to 150000, for example from 25 to 125000, 



WO 2004/009814 



PCTYDK2003/000490 



61 

such as from 25 to 100000, for example from 50 to 100000, such as from 50 to 
50000, for example from 100 to 50000. 

7. The method of any of claims 1 to 6, wherein the nucleic acid polymerase 
5 comprises a RNase H activity. 

8. The method of any of claims 1 to 7, wherein said method involves providing in a 
single reaction mixture more than 10 different templates and obtaining more 
than 100 different template switched products. 

10 

9. The method of any of claims 1 to 8, wherein the polymerase comprises reverse 
transcriptase activity. 

10. The method of claim 1, wherein the method does not involve annealing between 
1 5 the first template and the template switched product. 

1 1 . The method of claim 1 , wherein the method does not involve annealing between 
the second template and the template switched product. 

20 12. The method of claim 1, wherein the method does involve annealing between the 
first template and the template switched product. 

1 3. The method of claim 1 , wherein the method does involve annealing between the 
second template and the template switched product. 

25 

14. The method of claim 1, wherein the first template and/or the second template 
are nucleic acid molecules. 

15. The method of claim 14, wherein the first template and/or the second template 
30 encodes a polymer. 

16. The method of claim 14, wherein the first template and/or the second template 
encodes a polypeptide. 



WO 2004/009814 PCT/DK2003/000490 



62 

17. The method of claim 15, wherein the first template and/or the second template 
encodes a scaffolded molecule. 

18. The method of claim 1, wherein the template switched product is a nucleic acid. 

5 

19. The method of claim 18, wherein the template switched product encodes a 
polymer. 

20. The method of claim 18, wherein the template switched product encodes a 
10 polypeptide. 

21. The method of claim 18, wherein the template switched product encodes a 
scaffolded molecule. 



1 5 22. The method of claim 1 , wherein the first template and/or the second template 
are RNA molecules. 

23. The method of claim 22, wherein the RNA molecule has been synthesised by an 
in vitro transcription reaction. 

20 

24. The method of claim 22, wherein the RNA molecule has been chemically 
synthesised. 

25. The method of claim 22, wherein the RNA molecule has been purified from a cell 
25 extract. 

26. The method of claim 1, wherein the method furthermore comprises introducing 
into said first and/or second template one or more template switch signals prior 
to synthesis of the template switched product. 

30 

27. The method of claim 26, wherein the template switch signal is a breakage in the 
template. 
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28. The method of claim 26, wherein the template switch signal comprises a 
predetermined secondary structure. 

29. The method of claim 28, wherein the predetermined secondary structure 
5 comprises a double stranded nucleic acid . 

30. The method of claim 28, wherein the predetermined secondary structure 
comprises a double stranded RNA. 

10 31 . The method of claim 28, wherein the predetermined secondary structure 
comprises a double stranded DNA 

32. The method of claim 28, wherein the predetermined secondary structure 
comprises a double stranded nucleic acid, wherein one of the strand is selected 

1 5 from the group consisting of LNA and PNA. 

33. The method of claim 26, wherein the template switch signal comprises a 
nucleotide analogue. 

20 34. The method of claim 33, wherein the nucleotide analogue is capable of entering 
the active site of the polymerase, but not capable of being incorporated into a 
nucleic acid. 

35. The method of claim 33, wherein the nucleotide analogue is capable of being 
25 incorporated to the end of a nucleic acid molecule, and wherein incorporation of 

the nucleotide analogue inhibits elongation of the nucleic acid. 

36. The method of claim 33, wherein the nucleotide analogue is a phosphorothioate 
ribonucleotide analogue. 



30 



37. The method of claim 36, wherein the method comprises treatment of the first 
and/or second template with iodine. 
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38. The method of claim 27, wherein the nick(s) are introduced by limited enzymatic 
digestion. 

39. The method of claim 38, wherein the enzymatic digestion is performed by a 
5 ribonuclease. 

40. The method of claim 27, wherein the breakage is introduced by limited alkaline 
hydrolysis. 

10 41 . The method of claim 27, wherein the breakage is introduced by limited 
fragmentation using hydroxyl radicals. 

42. The method of claim 1 , wherein the synthesis comprises addition of one or more 
factors capable of affecting frequency and/or degree and/or accuracy of 

15 template switching. 

43. The method of claim 42, wherein said factor is selected from the group 
consisting of DMS, kethoxal, CMCT, and DEPC. 

20 44. The method of claim 42, wherein said factor is selected from the group 

consisting of a cationic detergent such as cetyitrimethylammonium bromide 
(CTAB), dodecyltrimethylammonium bromide (DTAB), tetramethylammonium 
chloride (TMAC), ions such as Na(+) and/or K(+) ions, and polymers such as 
polyethylene glycol (PEG), dextran sulphate, and polyvinylpyrrolidone. 

25 

45. The method of claim 42, wherein the factor is a protein. 

46. The method of claim 45, wherein the protein is the nucleocapsid (NC) protein. 



30 47. 



The method of claim 1 , wherein the synthesis is performed at a temperature in 
the range of 0 to 5°C, such as 5-1 5°C, for example 15-25°C, such as 25-50°C, 
for example 50-70°C. 
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48. The method of claim 1 , wherein the synthesis is performed in the presence of a 
divalent cation at a concentration in the range of from 6 mM to 25 mM, for 
example from 6 to 10 mM, such as from 10 to 20 mM. 

5 49. The method of claim 1, wherein the synthesis is performed in the presence of 

dNTPs at a concentration in the range of from 1 ^M to 100 jiM, for example from 
50 nM to 100 \M t such as from 10 nM to 75 jiM, for example from 20 jaM to 50 
MM. 

10 50. The method of claim 1, wherein said first and said second template molecule 
share in the range of 10 to 99% identity. 

51. The method of claim 1, wherein a region comprising 5 nucleotides in said first 
template share at least 40%, such as at least 60%, for example at least 80% 

15 identity with a region comprising 5 nucleotides in the second template.. 

52. The method of claim 1 , wherein said first and said second template molecule 
share in the range of 50 to 95% identity. 

20 53. The method of claim 1 , wherein one or more templates have been prepared by a 
method comprising the steps of 

i) Providing a DNA molecule; and 

ii) Providing a mixture of nucleotides and nucleotide analogues, wherein 
25 one or more of said nucleotide analogues once incorporated into a 

nucleic acid do not allow further elongation; and 

iii) Contacting said DNA molecule with said mixture and transcribing and/or 
replicating said DNA; and 

iv) thereby obtaining one or more different templates. 



30 



54. The method of claim 1, wherein one or more templates have been prepared by a 
method comprising the steps of 



i) 



providing a RNA molecule; and 
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ii) fragmenting said RNA molecule into RNA fragments; and 

iii) replicating one or more of said RNA fragments; and 

iv) thereby obtaining one or more different templates 

5 55. The method of claim 53, wherein the nucleotide analogues are selected from the 
group of dexoyribonucleotide analogs 

56. The method of claim 53, wherein the nucleotide analogues are selected from the 
group of ribonucleotide analogs. 

10 

57. The method of claim 1 , wherein the method comprises amplification of the 
plurality of template switched products. 

58. The method of claim 57, wherein said amplification involves a PCR reaction. 

15 

59. The method of claim 1, wherein the sequence of steps are repeated at least 
once. 

60. The method of claim 59, wherein template switched product(s) are used as a 
20 first template and/or second template when the method is repeated. 



61 . Use of the method of any of claims 1 to 60 for gene shuffling. 
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